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ABSTRACT 


Writing has always been natural mode of collecting, storing and 
transmitting information down the ages. Inscriptions are the earliest form of 
writing, and have defined human culture down through the centuries through 
material remains. They hold the key to information on the lives of our 
ancestors, and trace human survival in ancient societies. Different forms of 
inscriptions have been found through sources such as potsherds, copper 
plates, rock beds, palm leaf manuscripts, and walls encasing monuments. Of 
these, stone inscription have drawn remarkable attention in recent years from 
researchers and the government, chiefly because they are authentic sources of 
information on history, culture, social welfare, religious tolerance, civil 
administration and medicine. These inscriptions, found in a typically 
conventional form, are to be deciphered and translated into a contemporary 


language form in order for them to be read and their contents understood. 


Stone inscriptions have been found in several languages, including 
kannada, Malayalam, Urdu, Sanskrit, and steps taken to decipher them 
digitally. Compared to these languages, it must be noted that Tamil has 
possibly the largest quantum of inscriptions, holding a wealth of information. 
No digital system for Tamil. however, has been proposed so far. According to 
the guidelines laid down by the Department of Archeology and the Tamil 
Virtual Academy, Government of Tamil Nadu, India, a standardized Tamil 
alphabet came into existence in the 11" century, during the reign of King Raja 
Raja Chola. Plenty of inscriptions are available for the period in question. 
Hence the proposed work concentrates on Tamil stone inscriptions of the 11" 
century. The traditional method of estampaging contents from stone 
inscriptions is cumbersome and time- consuming, and so far, only 20% of all 


inscriptions have been acquired by the Department of Archaeology, 
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Government of Tamil Nadu. Owing to natural hazards and site work, a 
number of inscriptions have either deteriorated or been destroyed. There is, 
therefore a need to develop a system that speeds up the process digitally. Such 
a system facilitates public interaction in the form of a shared cultural heritage, 
backed by authentic information from a rich legacy of ancient stone 


inscriptions. 


In today’s fast-paced world with its incredible technology, 
character recognition is a demanding realm for researchers. In recent times, 
integrating human intelligence into computer systems has become a key 
research problem. However, it was character recognition that provided the 
impetus for pattern recognition and image analysis in science and 
engineering, and now facilitates communication between humans as well as 
that between humans and machines. As far as character recognition is 
concerned, the maximum work reported is in languages such as Chinese, 
Arabic, Thai, Hoysala (Kannada), Devanagari (Hindi), Malayalam, Marathi 
and English. Several techniques have been developed for handwritten and 


document images. 


The proposed work is twofold, and incorporates: 1. Digital 
acquisition, and 2. Character recognition and transliteration. A total of 30,210 
characters from 250 stone images, captured from the Tanjore Brihadeshwara 
Temple using a DSLR camera. An intelligent system that recognizes 
characters from Stone Inscription images is a challenge. This is because, there 
is little analytical understanding of the factors impacting a range of variations, 
and encompass spaces between characters, illuminations, stone color, hard- 


surface engraving, and cracked stones. 


Once the acquisition is done, the primary challenge handled in this 
work is to extracting foreground characters from the background, because the 


difference is not global. A sequence of methods is used to clean the image. 


The first step is the point processing method termed Linear Contrast 
Adjustment (LCA). Next, to completely eliminate background, binarization is 
carried out using a technique called the Modified Fuzzy Entropy-based 
Adaptive Thresholding (MFEAT) to separate the inscribed characters with the 


Iterative Bilateral Filter (IBF) to preserve edges and de-noise the Image. 


Dissecting characters from a lengthy script into individual 
characters is a critical area of research, since they vary with respect to strokes, 
height, width, structure and content. The segmentation method proposed 
follows three major steps: Horizontal Baseline Segmentation (HBS) to 
segment lines, Vertical Character Segmentation (VCS) to segment lines into 
single characters, and feature-based segmentation by means of line and curve 
extraction using the Progressive Probabilistic Hough Transform (PPHT). Line 
and character segmentation are done by finding adaptive thresholding from 


the histogram plot. 


The segmented image is to be classified. Characters are trained and 
tested using machine learning models like the pattern matching algorithm, 
Feature extraction and classification are carried out using Artificial Neural 
Network (ANN) and a deep learning model called the Convolutional Neural 
Network (CNN). The Image-based Character Pattern Identification (CPI) 
system is developed using the Modified Speeded-Up Robust Feature with 
Bag-of-Graphemes (MSURF-BOG) algorithm. Textual data is extracted from 
the character and matched with the data trained already. The overall accuracy 
obtained using the pattern matching algorithm is 72%, though it does take 
time to search for and locate the character. Another classification model, 
ANN, is developed. Qualitative and quantitative features are extracted prior to 
their being given to the classifier. These features are then trained to get the 
recognized character. The accuracy achieved with this method is 80%, though 
a degree of confusion is occasioned with similar kinds of characters surfacing, 


as a result of which certain characters are misclassified. 
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In order to improve accuracy and feature extraction overall, deep 
learning using the CNN is undertaken in this research. The CNN is trained to 
acquire a knowledge of the character through unsupervised learning. Neurons 
have 3-dimensional information in terms of height, width and depth. 
Convolution is applied to the input layer before the next layer is worked on. 
These layers have a set of filters called kernels, and computing the dot 
product on the filter entries provides an activation map. As a result, the 
network learns from the filter; detects character-specific features for each 
character, and achieves a higher recognition rate and accuracy at 90%. 
Finally, these ancient characters are identified and mapped to the 
contemporary Tamil character Unicode. This system paves the way for 
automatic inscription transliteration and also preserves information in a digital 


format. 


Despite the challenges faced by the proposed work in terms of 
dealing with greatly-deteriorated stone inscriptions against a poor 
background, the system successfully demonstrates higher accuracy and 
novelty. In future, misclassifications can be fixed by implementing character 
reconstruction techniques before preprocessing. Going forward, accuracy in 
transliterating compound characters and vowels can be improved by training 


the deep learning model with a much larger character dataset. 


Thus, the proposed system is designed, implemented and evaluated 
for its performance. The results observed are verified and validated by 
epigraphy experts for their correctness. Testimonials of the validation are 


enclosed as an annexure. 
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CHAPTER 1 


INTRODUCTION 


1.1 INTRODUCTION 


Developing a digital system to perform an operation on digital 
image is called digital image processing. It plays an active role in obtaining 
patterns and features from digital images. This introductory chapter discusses 
inscriptions as the writing system practiced in ancient times and followed for 
a while down the centuries, and often viewed as a rich repository of 
knowledge. It explains the motivation underlying this research, particularly as 
it applies to 11" century inscriptions, and the need for a proposed system such 
as this to deal with the problems involved. A novel system is designed for 
recognizing characters from stone inscription images. Character recognition is 
a challenge in inscription images, because the system must be trained to 
predict the characters as such humans would is the objective behind this 


research. 


1.2 INSCRIPTIONS — AN OVERVIEW 


Inscriptions refer to the ancient practice of engraving information 
on stones, copper plates, pottery, monuments and temple walls. Stone 
inscriptions have been found in substantial numbers, holding huge volumes of 
authentic information from the ancient to the medieval period, beginning from 
3rd Millennium BCE. These inscriptions offer insights into the lives of early 
monarchs, and details their activities, dates on which particular events 


occurred, genealogies, religious practices, political ties and administration, 
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land ownership, and the taxation system all of which depict a well-structured 
bureaucracy. To preserve and study inscriptions, the Archaeology Survey of 


India launched Epigraphy Branch in 1966. An 11" century character chart 


standardized by the Archeology Department is shown in Figure 1.1. 


Figure 1.1 11" century character chart from Epigraphy Branch 
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Presently, epigraphical efforts are focused on locating places where 
inscriptions are found; labor-intensive copying called estampaging; and the 
manual transliteration and scanning of estampage that is indecipherable. In 
this digital era, the manual copying of inscriptions, known as wax rubbing, is 
time- consuming. Further, the manpower required for transliteration is very 
scarce, leading to a great loss of information. Sample inscriptions captured 


from Tanjore’s Brihadeshwara Temple are shown in Figure 1.2 (a) & (b). 
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(b) 
Figure 1.2 Sample 11" century stone inscription images 


With technological advances, the digital acquisition and recognition 


of characters from inscription images has provided longevity, integrity and 
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public access. The Optical Character Recognition (OCR) system has been 


developed for handwritten and scanned document images for international 


languages like Arabic, Chinese, Thai, and English, as well as for national 


languages like Hoysala, Bangla, Marathi, Telugu, and Tamil. However, 


However, very few studies have been carried out on stone inscription images. 


1.3 MOTIVATION 


The motivation behind taking up this research on stone inscription 


image includes the following: 


Inscriptions contain a veritable mine of information about the 


past especially its history culture. 


Huge volume of stone inscriptions are at hand but are not 


entirely accessible. 


Traditional methods of copying inscriptions that are still 
practiced are laborious. Further, the quality of imprints is too 


poor to be digitized. 


Frequent estampaging causes rapid fading of the inscriptions, 


given the intense physical activity involved. 


Eventually, with the passage of time, they deteriorate to a point 


where they are rendered undecipherable. 


Only 20 % of all inscriptions have been copied over the last 70 


years. 
Very few scholars are available for manual transliteration. 


Estampages are poorly stored and deteriorate quickly, owing to 


factors like natural hazards. 
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e Compared to other languages, the work done on Tamil 


inscriptions is rather limited. 


Digitization and storage of inscriptions helps epigraphists and 
researchers access and interpret information for knowledge retrieval. 
Consequently, many researchers are presently working in_ character 
recognition from stone inscriptions, though with no inscription reader 
software available to date. Inscriptions have been found in a slew of 
languages, and their volume is represented pictorially in Figure 1.3 
(http://www.tnarch.gov.in/epi.htm). However, the maximum number of 
inscriptions were found in Tamil rather than in languages like kannada, 


Sanskrit, Telugu, Parkrit and Marathi. 


Volume of Inscriptions 


= Tamil 

= Kannada 
Sanskrit 
= Telugu 
= Parkrit 

= marathi 


@ Others 


Figure 1.3 The volume of inscriptions in various languages 


(tnarch.gov.in) 


The progress made in terms of manually copying and publishing 
inscriptions by means of epigraphy is shown in Figure 1.4. So far, only 
12,266 inscriptions have been copied through estampaging and preserved in 
document format. The deciphered inscriptions are around 8,337, of which the 


published ones are 2,590 and the yet-to-be deciphered ones at 3,929. 
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Approximately 9,676 are yet to be published, and documents preserved in 
paper format tends to deteriorate rapidly. deterioration. A report provided by 
the Archaeology Survey of India estimates that around 10,000 inscriptions 
have yet to be estampaged, given the tedium involved in the traditional way of 


copying inscriptions. 


Inscriptions copied by Epigraphy 
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3,929 


Works progressed 


Figure 1.4 Inscriptions copied by Epigraphy Wing (tnarch.gov.in) 


1.4 PROBLEM SPECIFICATION 


Stone inscriptions are found to be authenticate source materials 
with a wealth of information in them. The Department of Archeology, 
Government of Tamil Nadu, has had these inscriptions copied manually, 
using a technique termed wax rubbing. way called wax rubbing. In line with 
the statistics provided by the Archeological Survey of India, only 20% of all 
inscriptions ve been published in over 70 years. of duration. Accelerated 
deterioration is brought on by an array of environmental factors like bad 
storage conditions, paper decay, and seeping ink. A modernized approach 
looks to copy these inscriptions swiftly and robustly. A Digital copy of the 


inscriptions makes it easily accessible to the research community. The 
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challenges associated with processing and recognizing characters from stone 


inscriptions are: 


e Variations in stone colors, and uncontrolled illumination 

e Low-contrast differences that makes it difficult to differentiate 
between the foreground character and the background. 

e Huge volume of character sets. 

e Fast deterioration of both characters and stone 

e Organizational restrictions at heritage sites on using imaging 


devices. 


Recognizing characters from stone inscription images is complex. 
Figure 1.5 shows the importance of this research, given the issues involved 
in following the traditional method that results in deterioration and the 
disproportionate quantum of time involved. Advancements in technology 
have facilitated the quick recognition of characters as well as digital data 
preservation. The proposed work attempts to derive solutions for the 


problems to be dealt with. derives the solution for the problems faced so far. 


Huge volume of data 


Issues 

Traditional method of | Limited scholars for Time Deterioration of inscription 
inscription acquisition manual transliteration consuming due to natural hazardous 
Solutions 


Digital Automated Reduced time Digitized 
acquisition transliteration preservation 
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In the field of character recognition, various types of images and documents 
(both handwritten and typewritten) have been handled, as shown in Figure 
1.6. Studies on inscription images are so few as to be negligible, in terms of 


the number. The works on proposed inscription images is very less and 


countable. 
Character recognition 
Images 
handled 
Handwritten Document Deteriorated Scene text 
scanned image image document image image 


Proposed image 
in this research 


Figure 1.6 Character recognition — Images handled and the proposed 
inscription image 


1.5 AIM AND OBJECTIVE 


The aim of the proposed system is to develop an automated 
character transliteration system that helps to extract relevant scripts with 
specific information, intended for researchers interested in knowledge 
processing. This system will provide an amazing means for the collection of 
ancient data and pave the way to extract thematic information from wisdom of 


ages. 
The objectives of the research are 


e To develop an efficient model for digital image acquisition of 


stone inscriptions. 
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e To enhance inscription image quality by devising efficient 


image enhancement techniques. 


e To develop a machine learning system to recognize characters 


from stone inscription images with a high accuracy rate. 


e To transliterate recognized characters into contemporary Tamil 


characters. 


1.6 HYPOTHESIS 


With the passage of time, there is every possibility of losing the 
wealth of inscriptions for all time. So then, it is imperative to take measures to 
stockpile this antique information. The conventional method, where 
epigraphists take imprints of stone engravings with wet paper squeezes, wax 
rubbings and scale drawings is both time-consuming and strenuous. The 
proposed digital photography method is a comparatively easy and fast 
approach to capturing stone inscriptions straight in digital form. It eliminates 
the overhead of intermediate paperwork, and of scanning inscriptions into a 


computer for digital storage. 


No technology so far been developed to conserve ancient stone 
inscriptions in an interpretable digital form that is accessible to the general 
public specifically, the research community looking for an understanding of 
the past, as well as amateur archaeologists or epigraphers. The deliverables of 
this research constitute the recognition of inscription scripts, and being the 
equivalent script in modern Tamil. The end result of this work will lead to the 
development of a translator that converts the digitized inscription script, 


available in modern Tamil character script, into meaningful text. 
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1.7 RESEARCH METHODOLOGY 


This research describes the design of a Stone Inscription 
Transliteration System (SITS), where the digitally acquired stone inscription 
is taken as an input image to the proposed system. The schematic diagram for 
the SITS is shown in Figure 1.7. Image enhancement techniques are used to 
preprocess the input image and separate characters in the foreground from the 
background without noise, producing only the complete character script. The 
scripts are segmented using two methods, line segmentation and character 
segmentation. To facilitate character recognition, machine learning models 
called pattern recognition are used, alongside the Modified Speeded Up 
Robust Feature with Bag of Grapheme (SURF-BOG), features are extracted 


Camera captured Pre-processing 
stone inscription 
image Linear contrast adjustment 
Input 
inscription 
image | 


database Modified Fuzzy entropy 


based adaptive 
thresholding 


Segmentation 


(ine Chace Iterative Bilateral Filter 
Segmentation 


Segmentation 


Classification by Transliteration by 
Convolutional modern Tamil 
Neural Network Character 


Figure 1.7 Schematic diagram of Stone Inscription Transliteration System 
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and classified using Artificial Neural Network (ANN) and, finally, tested with 
the using Convolutional Neural Network (CNN). From these machine 
learning models CNN outperformed in character recognition, Once the 
characters are detected, they are transliterated into modern Tamil characters 


by mapping them to Unicode. 
1.8 EVALUATION METRICS 


The proposed system is developed to recognize characters from 
recognizing character from stone inscription images. An array of 
preprocessing, segmentation, feature extraction and classification techniques 
are applied for character extraction. A statistical analysis of the image is 
measured using the arithmetic mean, standard deviation (SD), peak signal-to- 


noise ratio (PSNR), precision, recall, and accuracy. 


The arithmetic mean is a basic statistical measure where the 
average of all pixel values is measured by using a sliding m x n window and 
replacing the center pixel in the output image. Equation 1.1 below shows the 
computation of the arithmetic mean 1, where W denotes the width and H 


denotes the height, and the gray value is given by g(x,y). 
1 
Me= Yi bya g(x) (1.1) 


The standard deviation (SD) is a measure of variations that exist in 
the average mean. The standard deviation value, o , is assigned to the center 
pixel value in the output image. It helps measure the level of intensity values 
at the edge pixels of the characters. W denotes the width and H the height, 
and the gray value is given by g(x,y). Equation 1.2 denotes the standard 


deviation. 


o= Yee Ly=1(9%,y) — 1)? (1.2) 
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The peak signal-to-noise ratio (PSNR) measures the difference 
between the original input image, X, and the reconstructed output image, X’. 
To compute the PSNR, the mean squared error (MSE) is computed first, using 
Equations 1.3 and 1.4. The number of rows is represented by M, and their 
corresponding index value by i. The number of columns is represented as N, 


and their corresponding index value by j. 


2552 
PSNR = 10lo9i0 (1.3) 


M,N 2 
Yi ja1 %i,j Xi.) 


M+N 


MSE = (1.4) 

Precision is the ratio of correctly classified characters to that of 
incorrectly classified characters as correct character. It is a measure of how 
many positive predictions are actual positive observations. Equation 1.5 


shows precision. 


True positive 


Precision = (1.5) 


True positive+False positive 
Recall measures the probability of classifying the characters 
correctly. It calculates the ratio of correctly classified characters to the total 


number of characters. Equation 1.6 shows precision. 


True positive 


Recall = (1.6) 


True positive+False negative 

Accuracy is a measure for evaluating classification models, and 
finds the fraction of predictions which concludes that the model has the right 
classification. The model’s performance is predicted by a close analysis of 


positives and negatives. 


True positive+True negative 
Accuracy = 7 (1.7) 


True positive+True negative+False positive+False negative 
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1.9 ORGANIZATION OF THESIS 


Chapter 1 offers an overview of the importance on Tamil stone 
inscriptions. It describes the motivation and objective behind the research, and 
specifies the problem. The complications of the existing method and the 


advantages of the proposed research are outlined. 


Chapter 2 presents a detailed explanation on the evolution of 
inscription characters, the need for the proposed system, and the traditional 
method of copying inscriptions. A detailed literature review on the efforts 
made so far in developing a character recognition system for various types of 


languages is presented. 


Chapter 3 highlights the importance of digital acquisition, and its 
advantages over the existing method. Images are tested with existing 
preprocessing techniques for handling stone inscription images. MFEAT- IBF 


algorithm is proposed to tackle the issues, and offers better results than others. 


Chapter 4 deals with the segmentation of the preprocessed 
binarized script. It handles two types of segmentation, line and character. The 
results obtained from the proposed method are compared with the existing 


method. 


Chapter 5 describes experiments on feature extraction and 


classification, using a pattern matching algorithm. 


Chapter 6 describes experiments on feature extraction and 


classification, using Artificial Neural Network. 


Chapter 7 describes experiments using the CNN for the 
classification and transliteration of ancient inscription characters into 


contemporary Tamil characters. 
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Chapter 8 concludes the thesis and offers suggestion on future 


work. 


1.10 SUMMARY 


This chapter has presented a general introduction on stone 
inscriptions. The need to recognize characters from stone inscriptions has 
been explained. The traditional system followed to date by the Epigraphy 
Branch and its drawbacks have been discussed. In today’s digital era, the 
proposed research offers a new approach and is a primary initiative. The 
overall system design has been introduced in this chapter, along with 
supportive machine learning models and the evaluation metrics that predict 
the results obtained. Finally, an overview of how the thesis is organized is 


presented. 
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CHAPTER 2 


LITERATURE SURVEY 


2.1 INTRODUCTION 


Emerging technologies today in the field of image processing train 
machines to receive input, interpret information, and describe pictorial 
information in text form. Character recognition from images is constantly 
developing and progressing, and has applications in document image 
recognition, scene text recognition, handwritten images. Humans know how 
patterns are modelled and recognized in nature. Hence, an automated system 
that reads characters from an image, thus replicating human reading, reduces 
time and bypasses the need to preserve paper documents. Research on 
machine perception has made possible automatic recognition through 
learning, and enabled a deeper understanding and appreciation of pattern 


recognition systems. 


In the field of character recognition, several methodologies have 
been implemented. For the purpose of this research, around 140 research 
papers were surveyed in the field of character recognition, particularly in 
terms of inscriptions, handwritten images and scanned printed document 
images in different languages. A total of 85 papers in all are discussed in the 
following sections. These include the following sections. These include the 
most recent and the relevant studies in preprocessing (20), segmentation (17), 
pattern matching and feature extraction (27), and classification and 


transliteration (21). 
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2.2 A HISTORY OF CHARACTER RECOGNITION SYSTEM 


In today’s fast-moving world with its incredible technology, 
character recognition is a demanding area of study. In recent decades, the 
most demanding research area has been the design of an OCR system. A 
telegraph code system was developed by Emanuel Goldberg in 1914 that 
could read digits and handwritten characters. Meanwhile a handheld scanner 
was developed by Optophon, a handheld scanner to scan the printed 
document. In continuation, Goldberg designed a handwriting recognition 
system and also proposed the template matching system. In 1933, Paul W. 
Handel filed a US patent on template matching methods for recognizing 
handwritten documents. In 1985, the structural properties of the characters 
were taken into consideration and single characters subdivided into multiple 
portions to identify patterns. In 1990, new and powerful pattern recognition 
systems with image processing techniques began to make their presence felt. 
In 1994, an OCR system was developed by (Radio Corporation of America) 
RCA engineers to help the blind. Methods like the Hidden Markov Model 
(HMM), ANN, and fuzzy systems were used to recognize handwritten 


documents. As technology advanced, numerous imaging techniques came into 


existence. 
2.3 BACKGROUND STUDY 
2.3.1 Inscriptions 


Inscriptions are the earliest form of writing. Jeniffer et al. (2014), 
state that material remains such as inscriptions have defined the culture of 
human beings across the centuries. Inscriptions are keys that help unfold 
masses of information on early human life and trace the survival of humans in 
former societies. The earliest traces of epigraphy in the Indian subcontinent 


are the undeciphered inscriptions of the Indus Valley Civilization (Indus 
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script), which date back to the early 3 millennium BC. Two major 
archaeological classes of symbols are those found from the 1“ millennium 
BCE, megalithic graffiti symbols and symbols on punch-marked coins, 
though most scholars do not consider these to constitute fully linguistic 
scripts, and their semiotic functions are not well understood. The earliest 
deciphered epigraphic inscriptions of significant length are the Edicts of 
Ashoka of the 3" century BCE, written in a form of Prakrit in the Brahmi 
script. Jain inscriptions in South India written in Tamil-Brahmi, the 
Bhattiprolu alphabet and the Kadamba alphabet are also dated relatively early. 
Indian epigraphy became more widespread over the 1‘ millennium, engraved 
on the faces of cliffs, pillars, tablets of stone, and drawn in caves and on 
rocks, with some gouged into the bedrock (Das et al., 2015). Later they were 


also inscribed on palm leaves, coins, Indian copper plates, and temple walls. 


The Epigraphy Wing was started by the Department of 
Archaeology, Government of Tamil Nadu, in 1966. The role of this wing is to 
copy and study inscriptions, and transliterate the information manually into 


contemporary language format. 
2.3.2 Estampaging 


The Department of Archaeology acquires inscriptions by a method 
called estampaging, where inscriptions are replicated on paper as they were 
on stone. The procedure for estampaging is described by Aswatha et al. 
(2014). Estampaging is carried out by tapping wet paper on to the inscribed 
stone. Indian ink is poured, or coal dusted, over the stone. A sample of 
estampages acquired by the Department of Archaeology is shown in Figure 


2.1, 
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to Cog Ss Se r 
a re An as 
FFITES HE z SDE IS SUAS Ae o 
PPPS Pg GHOSE TAs 


4 babes sss! Poet ong iS ca 


Figure 2.1. A Sample of the estampage collected from Department of 


Archaeology 


The series of distinct stages are listed below: 


e The surface of the stone is thoroughly cleaned with a wet brush. 


e Layers of thick paper, specially designed for estampaging, are 


moistened and pasted on the surface of the stone. 


e The wet paper is pressed down over the stone so it sticks to the 


engraved characters. 
e Black ink is usually over the wet paper. 


e Once the paper is dry, it is peeled from the stone. The characters 
are replicated into the paper in white, against a black 


background. 


The process described above takes a long time (several months) and 
is completely weather-driven. Post-processing includes analyzing the contents 
of the estampages and transliterating them into contemporary language form. 
The traditional system of inscription acquisition is shown in Figure 2.2. The 
inscription is acquired through estampaging, scanned for digital preservation 
and, finally, stored in the form of documents. In this process, not every 


estampage is scanned and preserved. Only a small number of estampages are 
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scanned. The rest are stored in paper format and therefore suffer the 


consequences of poor storage and deterioration. 


2 . =, , Sere: ae Se Ss 
SEO STS SAO a5 FOSS 
x EDR Se SE RS deere et 


Sat seat Sia ok eet 2 Beers Fas 
sere oS 


Storage in document format 


Figure 2.2 The existing traditional system 


203.3 Century-wise inscription evolution 


Each country is familiar with its own linguistic, kingdom and 
cultural heritage. India has ample information on its ancient history, culture, 
administration, medicine and health care, made available through inscriptions 
on stone and palm leaves. Figure 2.3 shows the evolution of vowels and 
consonants of Tamil characters over various centuries. Pillai et al. (1956) 
briefly described the evolution. The first writing system, termed the Brahmi 
script, made its appearance during the reign of Emperor Ashoka, and came to 
be known as the eponymous Ashokan Brahmi. A parallel script that 
developed simultaneously was the Sinhala Brahmi, dating back to the 3rd 
century BCE. The Brahmi script led to the development of the southern 


(Devanagari, Oriya, Bengali, Gurmukhi and Gujarati) and northern scripts 
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(Telugu, Kannada, Tamil and Malayalam). The Vattaezhuttu script had its 
origins in the 5" century AD and flourished until the 7" century AD, during 
the Pallava era. It slowly evolved into the modern Tamil alphabet from 8" 


century AD and became standardized in the 19" century. 


Most ancient historical inscriptions and other forms of textual data 
were written in Brahmi. Brahmi alphabet serves as the base script for a range 
of south Asian languages, including Tamil. Modern Indian scripts and others 
found in Southeast and East Asia are derived. Brahmi is known as a “syllabic 
alphabet”, implying that each sign is a simple consonant, or a syllable a 


consonant and an inherent vowel. 


Figure 2.3 Century-wise development of Tamil Characters (tnarch.gov.in) 
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Matras, which correspond to the same consonant with different 
vowel with extra strokes, are used to write languages such as Sanskrit, Tamil 


and Prakrit. 


Figure 2.4 shows the development of the character ‘ka’ from the 
basic Brahmi character over the centuries, from which modern Tamil and 
Devanagari characters are derived, according to the Tamil heritage 
foundation. It shows that from the 5" to 6" centuries, Brahmi characters led to 
the development of characters in Kannada, Grantham and Telugu. Compared 
to other centuries, a majority of the inscriptions were inscribed during the 11" 


century, as shown in Figure 2.5. 
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Figure 2.4 Development of the letter ka (&) in Devanagari, Tamil and 
other South Indian Scripts (Tamil Heritage Foundation) 
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Between 3“ BC to 5" AD, around 100 Brahmi inscriptions were 
engraved on potsherds, rock beds and caves. From the 6" to 9" centuries, 
around 900 vatteluttu and Grantham inscriptions were engraved. From the 
10" to 13" centuries around 19000 ancient Tamil character inscriptions were 
engraved on stone in temple walls and on monuments. Between the 14" and 
16" centuries, around 6000 inscriptions were engraved, but the number 
dropped to 2000 from the 17" to 19" centuries. The rapid deceleration makes 
it clear that the period between the 10" to 13" centuries had a wealth of 


inscriptions, many of which are yet to be acquired and transliterated. 
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Figure 2.5 Century-wise evolution of the volume of inscriptions 


2.3.4 Features of the Tamil language model 


Tamil is a Dravidian language predominantly used in the south Indian 
states of Tamil Nadu, Puducherry, Kerala, Andhra Pradesh, Karnataka and 


Telangana, as well as in Asian countries such as Sri Lanka, Singapore, 
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Malaysia. Tamil is considered to be one of the oldest surviving classical 
languages, wherein characters are divided into three categories: vowels, 


consonants and compounds. 


Vowels are termed ‘soul’ letters and consonants ‘body’ letters. A 
combination of the two results in compound syllabic characters. 
Contemporary Tamil has 247-letter alphabet, comprising 13 vowels (91 94 @) 
FF 2 9a 6T 6] 9 Ep GP Epa ..) and 18 consonants (& hl F @ L oo H H UL Wi ev oI 
LD et Mm oot). The remaining letters that constitute the alphabet are a combination of 
consonantal vowels called compound characters. Table 2.1 and 2.2 show the 
pronunciation of vowels and consonants, as well as their 15919 International 
Organization for Standardization (ISO) representations and was published in 


2001. 


Table 2.1 The alphabet - vowel 


Vowels Pronunciation ISO 15919 
a a 
A aa a 
Q) i i 
Fr li i 
De u u 
oar uu U 
61 e e€ 
J ee e 
ro al al 
& O O 
6&9 oO e) 
een au au 
ak Special character 
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Table 2.2 The alphabet - consonants 


Consonants Pronunciation ISO 15919 
& Ka k 
Al nGa n 
& Sa c 
@ Gna fl 
L Ta t 

OO Na n 
5 Tha t 
1) Nha n 
u Pa Pp 
Lo Ma m 
Ww Ya y 
J ra r 
60 la ] 
6 Va Vv 
Lp Za 1 
ert La l 
9) Ra t 
60T Pa n 


The features of Tamil characters are given below: 


24 


e The characters constitute a syllabic alphabet writing system, 


written horizontally from the left to the right. 
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e Vowels are always written as independent letters. 


e Since the letters contains curved strokes, they are ideally suited 


for writing on palm leaves without damage to the leaves. 


e Strokes added to the consonants produce additional compound 


characters. 
2.4 LITERATURE REVIEW 
2.4.1 Pre-processing 


Pre-processing suppresses distortions, enhances image features, and 
improves the quality of the image. There are several preprocessing techniques 
in use of which thresholding plays a major role in extracting the foreground 
information from an image and eliminating the background. Preprocessing 
techniques applied to document images, handwritten images and inscription 


images are as follows. 


Image pre-processing plays a vital role in enhancing the accuracy 
of character recognition. An overview of image enhancement techniques, 
described by Alginahi (2010), explains the foundations of processes like point 
processing, noise removal with filters, histogram processing, global and local 


thresholding, segmentation and morphological operations. 


Chaki et al. (2014) discussed the underlying principles in 
binarization techniques to compute the threshold value, and described five 
major techniques for binarizing images: finding the entropy, clustering, 
calculating image variance, error measure from the optimal threshold, and 


contrast variations in images. 


Das et al. (2015) implemented Fast, Independent Component 
Analysis (FICA) for binarizing characters from Kannada stone inscription 


images with highly correlated noises. The linear regression method is applied 
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to normalize the image. The global threshold is applied to binarize the image 
by finding the cumulative residual entropy, which provided better results than 


the Otsu method. 


Jayanthi et al. (2014) proposed a blind source extraction method for 
extracting inscription characters from the background. From the source signal, 
each independent component is considered, and the contrast maximised by 
higher order cumulants. The proposed method increased the readability of the 


text when compared with the ICA and Fast ICA. 


Lee (2016) proposed improved adaptive binarization using fuzzy 
logic. Image regions are divided into two: ambiguous and obvious. Obvious 
regions are predicted by applying global binarization, and ambiguous ones by 
means of local binarization using fuzzy logic. The triangular membership 
function is applied. Here, the alpha-cut value is set at 0.5. The obvious region 
is predicted when the alpha is lower, and the ambiguous one when the alpha- 


cut value is higher. 


Soumya et al. (2014) attempted to binarize degraded Kannada 
epigraphical images. The work is carried out in four phases. Firstly, degraded 
images are enhanced using different types of filters. Secondly, the images are 
smoothened using the Gaussian blur algorithm. Thirdly, a mask of the original 
images is created using unsharp masking. Finally, a Laplacian filter is used to 
perform spatial operations on the pixels. To binarize the images, the Otsu 
thresholding technique is used to find the optimal thresholding. The process 


above produced good results, but very few datasets were handled. 


Raj (2012) handled Tamil handwritten document images. A slew of 
basic image preprocessing techniques like binarization, noise reduction, 


normalization, skew correction, thinning, and slant removal are applied. 
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Evidently, preprocessing plays a major role in attaining good recognition 


rates. 


The central role of preprocessing ancient Tamil characters in 
Bhuvaneswari et al. (2014) is the preparation of the acquired image for 
recognition through background subtraction, noise removal, image glazing, 
and thinning. The acquired image is fed into the initially digitized image 
processing software. The first step is subtracting the background from the 
acquired image. The image’s regions of interest here are the Tamil characters 
in its foreground. The inscriptions are on stone, which forms the background. 
Stone is a rough surface during processing and digitalizing. Image glazing 
refines the skeleton formed by iterative thinning. Image glazing involves the 
following steps: (i) increasing the regions of rough edges, and (ii) enhancing 
the boundaries of the image. In a thinning operation, the size of the binary 
image is reduced to lines that approximate the middle skeletons of the 


character. 


Chandrakala et al. (2017) proposed a retinex method to enhance 
digitized estampages of Kannada inscriptions. The period of inscriptions 
covered in this work is the 11" century. Using the retinex method, the quality 
of the image is enhanced by highlighting foreground characters even in 
deteriorated estampage images. Finally, the digitized quality of the 


estampages is improved before they are stored in the database. 


Kavallieratou et al. (2006) developed a hybrid approach that 
combines global and local thresholding. Adaptive binarization is done 
iteratively. To separate the foreground pixels, the average value of the pixel is 
computed and subtracted from the neighborhood pixel values. Based on the 
noises present iteratively, thresholding is applied to extract the foreground 


accurately. 
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Singh et al. (2012) proposed a new local adaptive thresholding 
method for binarizing document images. In this work, the local mean value is 
calculated based on the window size, and the mean deviation is calculated 
based on integral sum value. This proposed method speeded up the process, 


compared to basic thresholding techniques like the global and local. 


Zhou et al. (2009) proposed an improved adaptive binarization 
algorithm using four steps: The Wiener filter is used for de-noising, and the 
background pixels are identified via interpolation. The Laplacian Gauss 
algorithm is applied for binarization. Finally, the background intensity and 
original image difference are computed by means of pixel intensity values. 
The process is applied on images with uneven illumination and good results 


are obtained by retaining the edges of the characters. 


Gatos et al. (2006) handled degraded document images by 
proposing a new adaptive binarization technique. Here, the degradation 
handled included uneven light illumination, contrast variations and noise 
corruption. The document image is pre-processed using the Wiener filter and 
the foreground characters are extracted by interpolation. In the post- 
processing, the strokes of the character are retained to ensure image quality. 
Nguyen et al. (2016) proposed a morphological method for segmenting text 
lines in Japanese handwritten script. The width of the strokes is measured 
using the Stroke Width Transform (SWT). The proposed method segments 


text lines and curves that touch. 


Document image binarization was performed by Lu et al. (2010) by 
through finding stroke edges. Bataineh ef al. (2017) is proposed adaptive 
binarization for degraded document images, that involves surface contrast 
variations. Here, the variance of the pixels is measured to find contrast 
variations between pixel. Geometric feature extraction is executed by 


generating geometric and pointer images. The experimental evaluation was 
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performed on statistical measures to produce better results, compared to other 


thresholding techniques. 


Devi. et al. (2006) implemented a thresholding technique for 
differentiating the foreground character from the background. Adaptive 
thresholding is done, based on the Quadratic Integral Ratio (QIR) algorithm 
that helps separate dark pixels from the foreground. Global thresholding is 
carried out using the Otsu algorithm which produced results only for good 
images, while adding noise to the rest. Comparatively speaking, the QIR 
method does a better job of thresholding than the Otsu. 


An algorithm for multilevel thresholding is proposed in Liao et al. 
(2001), that is faster than the Otsu. The algorithm works by maximizing the 
between-class variance, and the values are stored in the look-up table and 
recursive algorithm to find the optimal threshold value. Hence, the proposed 


method reduces the time complexity. 


Foreground-background separation in low-quality color document 
images was implemented by Utpal Garain et al. (2005), using the adaptive 
method. The connected component labelling algorithm and dominant 
background component differentiate between the foreground and background 


of the input image. 


Adaptive document image binarization was proposed in Su et al. 
(2013). Here, the contrast and gradient of the image cause variations between 
the text and background. The adaptive contrast map combined with the Canny 
edge identifies text stroke edges. Followed by the local threshold, the 
foreground text is extracted. The proposed method was tested on Document 
Image Binarization Contest (DIBCO) datasets and achieved higher accuracy, 


compared to other techniques. 
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2.4.2 Segmentation 


Proper segmentation is necessary to recognize a character. The goal 
of segmentation in the field of character recognition is to segment each 
character automatically so it can be dispatched to the next level of processing. 
Character strokes are crucial, given that missing a single stroke may alter the 
meaning of a word. Hence, segmentation must be carried out perfectly to 
retain the features of characters as such. What follows below is a survey of 


the segmentation techniques developed in the field of character recognition. 


Pannirselvam et al. (2014) proposed a segmentation algorithm for 
handwritten Tamil document images, based on the horizontal and vertical 
projection profile method. This method, tried on various document images, 


produced efficient results with a high segmentation rate. 


Sridevi et al. (2012) implemented a segmentation algorithm on 
ancient Tamil script document images. Segmentation is critical task when 
characters overlap. Hence, the Particle Swarm Optimization (PSO) is applied 
to segment lines from the script. A combination of connected components and 
the nearest neighborhood helped segment the characters and achieve good 


results. 


Nabil Aouadi et al. (2017) proposed a novel method for text line 
segmentation, chiefly concerning components from Arabic manuscripts. 
Segmentation is done in two steps: (1) the localized text component is found, 
based on the interpolation and shape of the character, and (2) based on a 
similar model stored in the dictionary, the central point of the text component 
is found. This method retains the skew of the character and the output is 
writer-independent. Dave et al. (2015) explained the levels of segmentation 


and baseline character extraction, followed by word segmentation through 
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finding the shortest path via the pixel-counting approach. The pixel-counting 


method outperformed other techniques. 


Roy et al. (2016) proposed a novel approach called zone 
segmentation for Indic scripts (Bangla and Devanagari). In this approach, the 
script is subdivided into three different zones - upper, lower and middle - to 
reduce the number of distinct component classes. The zone segmentation 
technique enhanced the performance of the recognition rate using the HMM 


classifier. 


Ryu et al. (2015) proposed a binary quadratic assignment technique 
to find gaps between individual words in handwritten document images. The 
irregularity in handwritten characters is trained using a Support Vector 
Machine (SVM) through structural learning. The proposed method was 
attempted on Latin and several Indian languages and produced state-of-the-art 


performances. 


Mangla eft al. (2014) developed a segmentation algorithm to 
segment overlapping and broken characters in Punjabi text, also known as the 
Gurmukhi script. Once the images are preprocessed, the header lines are 
removed. Using the vertical projection profile method, the characters are 
segmented horizontally. Additionally, broken characters are segmented by 
considering the neighboring pixels. Hence, the system works even for skewed 


and overlapping characters. 


Mei et al. (2013) worked on documents printed in Chinese 
character as input. Segmentation is done by finding the vertical projection 
profile from which the connected regions are identified. The text line is 
segmented by finding the upper and lower borders, and counting the number 


of black pixels in every column. To merge the connected components, the 
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height-width ratios are computed. Higher segmentation accuracy is achieved 


with the process above. 


Tan et al. (2012) proposed a nonlinear clustering method for 
segmenting handwritten characters. First, the complete text is divided into 
lines based on the strokes and the similarity is computed, based on the stroke 
gravity. From the similarity matrix index, cluster labels are obtained. Based 
on these labels, the strokes of the characters are combined. Two types of 
clustering methods, spectral and kernel, are proposed in this work. The 
proposed method was tested on four types of datasets and produced effective 


results. 


Saba et al. (2011) proposed an ANN approach for segmenting 
cursive handwritten words, tested on the Cedar benchmark dataset. The 
geometric features of the characters are taken to find their boundaries, the 
drawback being that a few characters end up getting oversegmented. The 
results produced by this approach were highly promising in terms of accuracy 


and reduced time complexity greatly. 


Kurniawan ef al. (2011) proposed an intelligent technique for 
segmenting overlapping handwritten words. To find overlapping portions, the 
feature vectors of the said portions are clustered and segmentation performed 


using a Self-Organizing feature Maps (SOM). 


By using SOM, the core zone of the character is identified and the 
computation cost reduced. Saeed eft al. (2009) proposed a segmentation 
algorithm based on a region growing technique for handwritten English and 
Arabic document images. The proposed reduced the work of skeletonization 


and saved time. 
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Garg et al. (2010) handled handwritten Hindi characters. 
Segmentation is done by decomposing the script into lines, word and 
characters. The first step is to locate the header line by considering row 
pixels. Words and characters are segmented by the vertical projection profile 


method. The accuracy achieved in segmenting Hindi character was high. 


Louloudis et al. (2009) handled historical handwritten documents 
where text line segmentation is done using the Hough transform, based on the 
connected components. Correctly detected text lines are identified using 
skeletonization. Word segmentation is done by finding the distance between 
the connected components using Gaussian mixture modelling. The efficiency 
of the algorithm was evaluated by comparing two datasets, the ICDAR 2007 


and a historical document. 


Murthy ef al. (2004) proposed a novel approach called the nearest 
neighbour classifier to segment an epigraphical script image. The problem 
handled in this work is the limited space difference between text lines and 
characters. The spacing is also not uniform. The proposed method helps 


segment characters from Brahmi epigraphical scripts. 


Manmatha et al. (2005) proposed a novel automated segmentation 
method to segment words using a scale-space algorithm for handwritten 
historical manuscript images. To find lines in the image, a gray-level 
projection profile is applied, followed by filtering the lines using the 
anisotropic Laplacian in an iteration. In the scale-space algorithm, the 


selection of scales is vital to recover words and achieve a reduced error rate. 


Zheng et al. (2004) proposed a technique for machine-printed 
Arabic documents. Two characteristics, known as structural and Isolated 
character detection, are vertical and horizontal cross points. Since the images 


have continuous words, the work is done in three parts and includes 
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segmenting lines, words, and sub-characters using a vertical histogram. 
Elgammal et al. (2001) developed a graph-based method for Arabic text 
segmentation. The structural features of the text are considered for graph- 


based segmentation. 
2.4.3 Feature Extraction 


Feature extraction helps detect desired portions (features) from a 
digital image. Identifying feature pixels from characters plays a critical role in 
achieving good recognition results. A set of features represents a feature 
vector. Feature extraction discovers the attributes and properties of the 
character. Equally, pattern recognition also identifies irregularities and shapes 
in image pixels. Below is a survey of feature extraction and pattern matching 


techniques in the field of character recognition. 


Arif Mohammed ef al. (2015) presented an overview of feature 
selection and extraction for handwritten characters. Statistical feature-based 
techniques like zoning, crossing and projection are discussed, along with 
structural features like counting topological structures and measuring 


geometric features, graphs and trees. 


Mohana et al. (2015) developed a Kannada character recognition 
system for stone inscription images. Inscriptions are captured using a normal 
camera and the image is pre-processed to enhance the quality. The characters 
are segmented and feature extraction done. Structural, statistical and global 
features are extracted using entropy and intensity measures, and classified 


using an SVM. 


Bhuvaneswari ef al. (2015) developed a recognition system for 
stone inscription. Here, the features of characters are extracted based on a 


structural feature called an image and a regional feature called a zone. The 
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features of the image are extracted by considering the position of the character 
pixels. The first pixel position, as well as the average distance between pixels, 
is computed using the Euclidean distance. Classification and recognition are 
done using the nearest neighbour classifier. The combined structural and 
regional feature extraction methods achieved a better recognition rate than 


existing methods. 


Rajkumar et al. (2012) proposed a novel slant correction method 
for ancient stone inscriptions. Quantile feature points and a horizontal density 
histogram are used to find the core region of the inscription image. Based on 
the threshold value, the upper line, middle line and lower line are predicted. 
The uniform slant angle is estimated by finding the maxima and minima of 
the character, connected by a straight line. With slant estimation, the 
characters are trained using the HMM to produce an increased accuracy rate 


and reduced time complexity. 


Rajakumar et al. (2012) proposed an artificial immune system for 
recognizing characters from stone inscription images. The system is designed 
to learn characters by means of a self-adaptive and developing immune 
memory as it relates to the features of the character. Curve features are 
extracted in the time domain and line features in the frequency domain using 
the sliding Hamming window. The method also helps find abnormal patterns. 
The proposed method, when compared with the neural network classifier, was 


faster and achieved higher accuracy. 


Janani et al. (2016) developed a Tamil inscription recognition 
system. Initially, segmentation is done by considering the connected 
component as well as__ structural features such as lines, slopes, skewness and 
circles. Using two techniques known as bilinear interpolation and linear 
interpolation, template matching is carried out with the equivalent modern 


Tamil alphabet. The method helps detect ancient characters and _ their 
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equivalent modern characters. Wahi ef al. (2015) proposed a pattern 
recognition system for handwritten Tamil characters. The major contribution 


of this work is the extraction of features using Zernike moments. 


Rajithkumar ef al. (2014) developed a system for recognizing 
characters from Kannada stone inscriptions. Images are captured using a 
normal 16-megapixel mobile camera. Key points are extracted with the Scale- 
Invariant Feature Transform (SIFT) and an image mosaic obtained. A cross- 
correlation is executed to discover the template of the characters. This is done 
by finding key points for the continuity of the characters through computing 
the Euclidean distance. Based on the high and low correlation values, the 


system recognizes the characters. 


Mohana ef al. (2014) implemented the SIFT algorithm for 
extracting characters from Kannada stone inscription images. The features of 
each character are extracted by considering the scale space, key point 
localization, descriptor and orientation, and stored in the database. Finally, the 


character is matched, based on the template stored in the database. 


Siddiqui et al. (2018) developed a handwritten script identification 
system by extracting features using the Discrete Cosine Transform (DCT), in 
combination with the Discrete Wavelet Transform (DWT). The proposed 
method produced good results in identifying eight types of scripts. 


Chaowicharat et al. (2016) proposed a novel feature extraction 
method called a direction histogram and a bag-of-words histogram for a Thai 
handwritten dataset. The direction histogram is designed to handle 
miscellaneous writing styles while the bag-of-words histogram tolerates 
variations in curves and thickness. The proposed method was implemented 
for 52 datasets of 80 Thai characters each, and showed a good recognition 


rate, even for distorted handwritten datasets. 
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Aggarwal et al. (2015) developed a recognition system for 
handwritten Gurmukhi characters and numerals using the gradient feature 
extraction technique. Feature gradients are extracted in one of two ways: one 
subdivides a character and numeral into blocks called sub images. The second 
concatenates the obtained gradients into feature vectors. The recognition rate 
increased for the detection of both Gurmukhi characters and numerals. Their 
future work focuses on an extension that recognizes both characters and 


numerals in a combined dataset. 


Yao et al. (2016) proposed a novel circle detection method using 
the curvature-aided Hough transform which estimates the radius of a circle 
from the curvature. Curvature estimation helps reduce the accumulating 
points and interruptions among varying scales. Such a process gets the system 
working faster to detects circle. The algorithm is tested on traffic sign images 
and compared with the conventional Hough transform. The proposed 


curvature-based method was found to be less time-consuming. 


Fujisawa et al. (2003) used a directional pattern matching 
technique for recognizing kanji characters in both machine-printed and 
handwritten data- sets. Based on the local stroke components in the character, 
directional planes are generated. For direction feature extraction, the 
frequency response is analyzed using a gradient mask. Based on similarity 
properties like continuity, stroke direction, thickness, and stroke angles, the 


direction position improved the performance of character recognition. 


Prasad et al. (2009) proposed a novel solution for recognizing 
handwritten Gujarati characters. Here, the feedforward neural network is used 
with the backpropagation learning algorithm. Images are subdivided into 
templates and a linear cross-correlation is performed to find similar patterns 
between the already-trained and input testing images. The results were 


validated using a series of statistical tests called template classification cross- 
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correlation coefficient analysis. The recognition rate improved even for 


similar characters. 


Schmid et al. (2000) introduced two assessment criteria for interest 
point detectors called a repeatability rate to assess geometric stability under 
assorted transformations, and information content to deal with the 
individuality of the features involved. Based on the two criteria above, 
contour-based, intensity-based and parametric model-based interest point 


detectors were compared. 


Liu et al. (2001) proposed a structural feature matching approach 
for recognizing handwritten Chinese characters. Structural matching helps 
predict strokes for structural interpretation, and is performed by developing a 
model for each reference character through an attributed relational graph. 
Strokes and inner strokes are identified by matching them with the character 
referenced. Strokes, coupled with consistent matching, help recognize 
characters in the KAIST image database. The method demonstrated promising 


results. 


Leutenegger et al. (2011) proposed a novel method called the 
Binary Robust-Invariant Scalable Key point (BRISK) that has three stages: 
key point detection, description, and pattern matching. Key point detection is 
performed on a scale space by constructing n octaves, followed by sampling 
and rotation estimation. Descriptor matching among the selected key points is 
computed by finding the Hamming distance. Based on the brightness of the 
image, circular patterns are formed. These detect key features that help in 


faster computation and utilize limited computation power. 


Fischer et al. (2012) developed a word spotting system for scanned 
handwritten document images, based on the HMM. The inputs to this system 
are text lines and keyword strings. Images are normalized to render them 


adaptable to different writing styles and local features are extracted using a 
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sliding window. A score is then assigned to the lines of text extracted, based 
on the keyword. Based on the threshold value, the keyword can be spotted. 
The system is trained using the HMM that is mapped with the keyword text 
line which identifies the keywords present in the scripts. 


Mohammad et al. (2014) implemented an OCR algorithm for 
handwritten documents to convert them into an interpretable form. The 
system works in four stages: (i) preprocessing by grayscale conversion, (ii) 
feature extraction by scanning text lines from the upper-left to the lower- right 
comer, (iii) recognition by pattern matching, and iv) generating a binary 
format of the template image. The algorithm is trained for the same, and 


consequently does not work for font variations and bad handwriting. 


Rusol et al. (2015) developed a technique for word spotting without 
segmentation using the Query By Example (QBE) paradigm for historical 
documents. Local patches are formed by constructing a bag-of-visual-words. 
A latent semantic analysis is done on the topic space and the descriptors are 
compressed using the quantization method, which efficiently indexes 
documents in terms of memory and time. The technique is applied on both 
handwritten and typewritten documents and achieved a good performance 


rate, compared to the existing character spotting method. 


Kan et al. (2002) classified alphanumeric characters using 
orthogonal moments. Local information about the character is identified by 
the Orthogonal Fourier-Mellin moments (OFM) using radial polynomials. 
The algorithm works for small-sized character datasets and is less sensitive to 
noise and character variations. The proposed algorithm is compared with the 
Zernike moments, which works only for large-sized character databases. The 
OFM with circle-spaced centroid bounding causes reduced distortion. Hence 
the OFFM performed better overall, compared to the Zernike moments with 


FM (Fourier Moments) scaling. 
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Nawaz et al. (2009) developed an offline OCR system for isolated 
Urdu characters of different fonts using a pattern matching algorithm. The 
input image is scanned and a chain code created in the form of binary images 
termed 0’s and 1’s. A database in an xml file is created for all the letters of the 
Urdu alphabets as a parent-class node. Diacritics are removed using a special 
filter called the Tuay filter, along with morphological techniques. The testing 
character is matched with the chain codes of the trained characters that are 


calculated. 


Kacem eft al. (2017) proposed a texture-based approach to 
differentiate between Arabic and Latin scripts, based on the writing 
orientation and length of the stroke. Structural features are identified by 
obtaining a histogram from a black-and-white image that is subject to the run- 
length encoding method. The K-nearest neighbor classifier is used to find the 
differences between the machine-printed and handwritten scripts for both 
languages. The system also works well for different font styles and words. 
However, the method worked only for images of good quality and achieved a 


good recognition rate. 


Cheung et al. (1998) proposed deformable models to identify 
patterns with shape variations. The model, combined with the Bayesian 
framework, helps match and recognize patterns for handwritten digits. The 
model was tested on the NIST SD-1 dataset and handled 11,791 test samples. 
Iterative deformable matching helps faster implementation on sequential 
computers. Input samples suffer from information redundancy, which is 
resolved by using subsampling techniques. This model provides fast matching 


and achieved a high recognition rate, even with shape variations. 


Belongie et al. (2002) proposed a novel approach to find the 


similarity measurement between shapes and exploit it in object recognition. 
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Finding two points with a connection in two different objects, and estimating 
the transformation between those points gives the similarity measurement. 
The selected reference point finds the overall connection between other points 
and provides a global discriminative characterization. Similarly, dissimilarity 
is computed by finding the error between the connection points and the 


nearest neighbor classifier used to recognize the objects. 


Bharath et al. (2012) developed an online character recognition 
system based on the HMM, using a lexicon and non-lexicon driven technique 
with a bag-of-symbols illustration for handwritten Devanagari and Tamil 
scripts. Here, the proposed system is completely script independent and 
accommodates different writing styles. Features are extracted by considering 
the direction of strokes, curves, slope and linear lines, and recognizing words 


by matching them appropriately. 


Del Bimbo et al. (1994) developed an elastic pattern matching 
technique for recognizing digit models. Templates are derived for each digit 
model by performing elastic deformation. Pattern-matching is undertaken 
with two requirements to be met: finding a possible match for the image and 
reducing elastic deformation. The process offers an optimal deformation 


solution in terms of matching and recognizing digits with derived templates. 
2.4.4 Classification and transliteration 


The numerical properties obtained from the features of the image, 
and segregating data into categories is called classification. The data is 
classified into certain classes, and is understood by the computer through a 
training process. Transliteration is the process of converting data from one 


language to another and, specifically, into a recent and understandable format. 
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Karunarathne et al. (2017) developed an OCR system for Sinhala 
inscriptions. Following preprocessing and feature extraction, template- 
matching is done to compare pixel values. Characters are classified using the 
ANN and backpropagation algorithm, and implemented with the CNN, which 
produced higher accuracy than the ANN. 


Beulah et al. (2015) developed an efficient system for translating 
Tamil Grantham characters into modern Tamil characters. First, images are 
binarized and characters segmented using the PSO. To construct a clear 
image, the Probabilistic Neural Network (PNN) is applied to help reduce 
noise. Finally, the characters are transliterated by constructing a Grantham 


font and using a mapping technique. 


Kunchukuttan et al. (2015) developed an online system called 
Brahmi-Net that translates and converts a script into other Indian languages. 
Parallel and statistical transliteration are performed by creating a mined 
corpus. A large number of languages gas been transliterated by considering 
the scriptural similarities as a key ingredient. The system helps transliteration 
in 10 Indian languages. Elakkiya et al. (2017) developed a character 
recognition system for handwritten Tamil script images. Here, the K-Nearest 
Neighbour (KNN) classifier is used to transliterate Tamil scripts into an 


editable format. The algorithm increased both accuracy and speed. 


Ruwanmini et al. (2016) handled Sinhala stone inscription images. 
They developed an OCR system to transliterate inscriptions into modern 
Sinhala characters, and track inscription sites. Images are preprocessed, 


features extracted, and characters classified using the K-means algorithm. 


Vellingiriraj et al. (2016) handled Tamil Brahmi and Vattaezhuttu 


characters. He proposed a zoning-based algorithm for classification, and 
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compared the translation results of both types of characters mentioned above. 


The Brahmi translation demonstrated higher accuracy. 


Mahalakshmi ef al. (2013) transliterated Tamil Brahmi inscriptions 
into 21“ century characters using Laboratory Virtual Instrument Engineering 
Workbench (LabVIEW). Various optimization techniques are implemented 
for segmentation and the counter let transform used to enhance the ancient 
script. The green channel present on the character is identified and histogram 
matching done in the enhancement module. Brahmi characters are 


transliterated using LabVIEW. 


Al-Farjat et al. (2012) developed a system for transliterating text 
from one language to another. The system uses the Unicode chart for the 
character dataset. The codes are subdivided into 128-character slots, assigned 


to each language, and used to identify the target language script. 


Mari et al. (2015) proposed a view-based feature extraction 
method for recognizing handwritten Tamil characters. Here, five major views 
of a character are considered and divided equally into 16 zones, from which 
80 feature values are extracted and classified using the SVM and Multilayer 


Perceptron (MLP) classifiers. Accuracy of 95.2% was achieved. 


Choudhary et al. (2013) proposed a multi-layered feedforward 
neural network for the classification of handwritten English character images. 
The difference between the desired and actual output is calculated for each 
cycle, and the weights adjusted during error backpropagation. The process 
continues till the network converges to the allowable or acceptable error. This 
method involves the backpropagation learning rule, based on the principle of 
gradient descent along the error surface in a negative direction. Promising 


results were achieved when binarization features and the multilayer 
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feedforward neural network classifier are used to recognize offline cursive 


handwritten characters. 


Kavitha et al. (2013) developed a multiple script recognition system 
which classifies the English form of the Indus script. Two approaches have 
been proposed: skewness-based and nearest neighbor-based. Skewness is 
measured with respect to the x-axis, and the nearest neighbor computes the 


modifiers present in both the Indus and English scripts. 


Manigandan ef al. (2017) proposed natural language processing 
techniques for recognizing Tamil characters from the 9"" to 11" centuries. The 
SIFT algorithm is used to extract feature vectors from the character, followed 
by training with an SVM classifier. The patterns of each character will be 
matched with the data stored earlier and mapped to the corresponding 


Unicode, which replaces modern Tamil characters effectively. 


Shanthi et al. (2010) developed a recognition system for classifying 
Tamil handwritten scanned images using an SVM classifier. Seetha Lakshmi 
et al. (2005) handled printed Tamil documents. Images are preprocessed for 
contrast enhancement and skew correction, followed by segmentation to 
predict the interline spacing between texts. The structural features were 
identified, based on this features the characters were classified using SVM. 
Finally, the characters are transliterated by matching them with the Unicode 


of Tamil characters. 


Meng et al. (2018) implemented deep learning techniques for 
recognizing ancient Asian characters so as to preserve and digitize them. 
Tamen et al. (2017) implemented a multiple classifier to recognize characters 
from Arabic handwritten words. Statistical contour features are measured 
from each character. An integrated multiple classifier, comprising the SVM, 


MLP and Extreme Learning Machine (ELM) with decision levels, is 
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constructed. The IFN/ENIT dataset was evaluated with these integrated 


multiple classifiers and achieved a good recognition rate. 


Das et al. (2015) handled the Japanese hiragana character by 
finding its geometric topology. Conjunction points that correspond to closed 
loops and strokes are identified. With these points, the template image is 
created, matched with the input testing character, and the character is 


recognized. The accuracy obtained was 94.1%. 


Chacko et al. (2015) used the feedforward neural network for 
Malayalam handwritten characters. Character features are extracted, based on 
the gradient and density. Four types of rules are checked for combining the 
results obtained from individual classifiers. The max rule selects the 
maximum confidence value from the output, and the sum rule sums up all the 
class values and selects the highest. The product rule gives the maximum 
product values and, finally, the Borda count rule is used. Among the rules, the 


product rule achieved a good recognition rate. 


Xiao et al. (2017) proposed a global supervised learning and 
adaptive weight technique for handwritten Chinese character recognition. The 
proposed implementation is compared with the CNN, proposed low 
computation costs, and 30 times faster than CNN. Basu et al. (2010) discussed 


an array of survey methods for pattern recognition applications in ANNs. 


Chaudhuri et al. (2017) proposed a fuzzy-based approach for 
classifying Hindi characters. Feature extraction is done with the fuzzy Hough 
transform and classified using the fuzzy SVM, fuzzy MLP and fuzzy rough 
SVM classifiers. An evaluation was carried out to determine the superior 
classifier that best recognizes Hindi character. Pal et al. (2010) implemented 
an SVM for classifying Bangla and Devanagari texts. Angular information is 


extracted by finding the contour pixels of the character, followed by a zone- 
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wise feature extraction of the circular and convex hull. The system was 


trained using these features and achieved a high recognition rate. 


2.5 


RESEARCH GAP IDENTIFIED 


From the detailed review above, the gaps between the existing and 


proposed work identified are listed below: 


Compared to other Languages, a huge mass of inscriptions in 


Tamil has been found. 


No effort has been made to digitize the copied inscriptions and 


are archive them. 


Inscriptions are copied from stone using the laborious manual 
estampaging method. Further, only 20% of all Tamil 


inscriptions have been copied. 


Estampage acquisition, unfortunately, damages the original 


inscription. 


Stone inscriptions deteriorate quickly owing to field work and 
natural hazards. In addition, estampages preserved in 


document format deteriorate just as quickly. 


Effective automated character recognition techniques need to 
be developed for speeding up digital reading and recognizing 


ancient characters from stone inscription images. 


Comparatively speaking, studies carried out on character 
recognition from Tamil stone inscriptions are so few as to be 
countable. On the contrary, work on stone inscriptions, 
handwritten documents and printed text document images in 


languages other than Tamil is enormous. 
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The difficulties faced in the proposed work as it differs from the 


existing work are discussed below: 


Handwritten document image Proposed stone inscription image 


e Collecting datasets involves only e Collecting datasets is undertaken 


scanning in person from various sites 
e Even illumination conditions e §=Varying illumination conditions 
e No breakage in the characters e Deteriorating stones and damaged 
characters 


e Contrast differences exist between e The differences between the 


the foreground and the background foreground and background are 
insignificant. 
2.6 CONQUER THE CHALLENGES 


The following section describes the need of the proposed system 


and how it handles challenges thrown up by the existing system. 


e The work carried out in the field of character recognition so 
far has been restricted to handwritten and printed document 
images in several languages. As far as inscription images are 


concerned, however, very little work has been undertaken. 


e Inscriptions on stones add to the complexity because of their 
unconstrained nature. Digital acquisition is proposed to 
identify characters on stone. This reduces the time consumed 
and protects the stone from damage brought on by 


estampaging. 


e Compared to document images, there is very little contrast 
difference between the foreground and the background in 


stone inscription images. Hence, an efficient image 
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preprocessing technique needs to be developed to extract 


foreground characters. 


e Miscellaneous segmentation algorithms have been discussed 
in literature reviewed. However, segmenting characters in 
inscription is fraught with difficulty, because of the breakage 
in the character and the fact that the gap between the 


characters is minimal. 


e To recognize and transliterate ancient characters, the system 
must be trained with huge volume of character sets. A 
machine learning technique is to be developed that recognizes 
ancient characters and simplifies manual transliteration and to 


make the work faster than the existing system. 


2.7 SUMMARY 


In the literature reviewed, a slew of preprocessing techniques, 
segmentation methods, feature extraction and pattern-matching algorithms, as 
well as classification approaches, have been discussed. The brief survey 
makes it clear that work on character recognition is concentrated largely on 
clear document images and scanned handwritten images. There is very little 
ongoing research on recognizing characters from real-time images. Based on 
the insights drawn from the discussion above, it is evident that no efficient 
techniques have been developed that recognize characters from stone 
inscriptions. In conclusion, this chapter has summarized the existing system 


and its challenges, and justified the need for the proposed system. 
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CHAPTER 3 


STONE INSCRIPTION IMAGE ACQUISITION 
AND ENHANCEMENT 


3.1 INTRODUCTION 


Image enhancement plays a major role in image processing 
systems, since images that are acquired cannot be used in their raw form for 
such systems. Images are to undergo preprocessing to enhance their quality, 
which helps suppress distortions and highlight their key features. 
Preprocessing helps alter the pixels in an image, based on the neighborhood 
pixels. This chapter briefly explains the image acquisition and preprocessing 
steps proposed to extract characters from stone inscription images by 


eliminating the background. 
3.2 DIGITAL IMAGING 


Image acquisition is the primary step in image processing, which 
involves obtaining the information from a source. Images captured on camera 
factor in two major components: illumination sources and reflection from the 
image being viewed. The general architecture of digital imaging is shown in 
Figure 3.1. Captured analog images are digitized both spatially as well as in 
terms of amplitude. Analog to digital conversion involves two steps, sampling 
and quantization. Spatial resolution is obtained through the sampling rate and 


gray levels through quantization. 
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a 
canine Analog to Digital 


Device 


Acquired 
Digital 


Image 


Figure 3.1 Digital Imaging Architecture 


In general, analog images are acquired through digital camera 
wherein the Charge Coupled Device (CCD) sensors are arranged in a 2-D 
array format, though with high power consumption. Hence an advanced 
camera called the Digital Single Lens Reflex (DSLR) that contains a 
Complementary Metal Oxide Semiconductor (CMOS) Sensor is used in this 
research. The DSLR camera used to capture stone inscriptions helps find the 
depth of the characters engraved on stone, apart from enhancing photo 


quality, speed and flexibility. 
3.3 DATASET ACQUISITION 


The method proposed in this research takes camera-captured 
images as input, particularly inscriptions captured from Tanjore Brihadishvara 
Temple, which were engraved during the reign of Raja Raja Chola in the 11" 
century. It was during this century that characters were standardized and the 


volume of inscriptions at an all-time high. A survey report of the Department 
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of Archeology, Government of Tamil Nadu, concludes that 19,000 


inscriptions in all were engraved from the 11" to 13" centuries. 


(c) 


Figure 3.2 (Continued) 
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(d) 


(e) (f) 


Figure 3.2 (a), (b), (c), (d), (e), (f) Sample of 11" century stone 
inscription images captured using DSLR. 

These inscriptions were captured with a Canon EOS-600D DSLR 
camera, and the images displayed in a 3-in varying angle with a clear view 
Liquid Crystal Display (LCD) screen, which helps take shots from difficult 
angles. An 18-megapixel sensor with a high ISO 6400 was used for low-light 
capture, auto focus and flash capability. In all 250 inscriptions were captured. 


a sample of which is shown in Figure 3.2. 
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3.4 BINARIZATION TECHNIQUES 


The two types of images in use are document images and camera- 
captured ones. Binarizing a document image is easier than binarizing a 
camera captured stone inscription image. A document image can be binarized 
easily unless the document is degraded, because of the contrast between the 
background and the foreground. However, in camera-captured stone 
inscriptions, the foreground and background are the same, with hard, rough 
stony surfaces characterized by blobs and cracks. As a consequence, detecting 
the strokes that constitute an integral part of characters from stone inscription 
is a challenge. Several techniques have been reported that deal with the 
above-mentioned issue. Much of the work carried out uses the Otsu algorithm 
(1979) to reduce the intra-class and inter-class variance. In other words, the 
average of the two classes, called the foreground and background, is 
differentiated as much as possible. Several researchers have used the Otsu 
algorithm (Otsu (1979), Jian Zhuang et al. (1991), Liao et al. (2001), 
Wenaqing (1993)) to binarize images. Though the algorithm separates the text 
character from the background, some character pixels are broken, and hence 
the continuity of the character is not assured. In case of variations in 
illumination, the character cannot be recognized, hence, the resultant image 
pixels end up getting meshed. The proposed method is also compared with 
other binarization techniques, as in Shaus et al. (2012), Sauvola et al. (2000), 
Bernsen (1986), and Niblack (1986). 


A global thresholding algorithm with its fixed threshold values 
gives good results only for images with a contrast between the foreground and 
background. However, in a stone inscription image, single fixed threshold 
values cannot be implemented because there are minimal differences between 
the foreground and background. Consequently, an adaptive thresholding 


method is used to compute the threshold at each local pixel value. 


asta ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


Digital acquisition of stone inscription image 


54 


Stretch the pixel 
from 0 to 255 


Compute entropy 
H(E) image X 


Foreground 
character separated 
from background 


Noisy binarized 
image 


A Input image I (x,y) 


Linear Contrast Adjustment (LCA) 


Compute 
histogram 


, LCA image I (x,y) 


Proposed system 


Compute entropy 
H(E) for each class 
H,(E)...Ho(E) 


Apply adaptive 
threshold at various 
regions 


Bilateral filter 


Filtered Image 


Scaling the 
pixel 


Compute Gaussian 
membership 
function us(x) 


Determine 
threshold value at 
each sub image 


Noisy detection 


Character edge 
detection 


ore 7 
REL B SSA] PTD 
aOR P PIG OD FF _AEQALO Nee 


DGD FMP_NIGLON sO ARIAS TG 
> 

“RM ROO EBA —- BI TTA AG 

; NS 7 _ Leh eye , 


Figure 3.3 Architecture of the proposed MFEAT-IBF system 
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An efficient solution for foreground and background extraction is 
proposed in this research called Modified Fuzzy Entropy based Adaptive 
Thresholding (MFEAT), it works alongside an Iterative Bilateral Filter (IBF) 
The overall block diagram, which separates characters from the background 
image, is shown in Figure 3.3. The proposed method for extracting characters 


from stone inscription images is discussed in the following section. 
3.4.1 Linear Contrast Adjustment (LCA) 


Enhancing and the visual quality of images helps perfect the 
process of character extraction from stone. The two types of enhancement 
methods available are the spatial domain and frequency domain. The spatial 
domain represents space in a 2D image in the (x,y) plane. It modifies each 
current pixel intensity value so as to adjust it to low contrast or high contrast, 
and enhances the characters engraved on hard stone surfaces by a technique 
called Linear Contrast Adjustment (LCA). For example, in a stone inscription 
image, given that the minimum value is about 60 and the maximum value 
about 190, values between 0 to 60 and 191 to 255 cannot be viewed. Hence, 
values should be stretched from 0 to 255 so that the resultant image has no 


broken characters. In general, Eq. 3.1 is given as: 


loutay) = PUin(%,y)) Gd) 


In Eq. 3.1 I5y¢(%, y) represents the output image which gives the 
modified pixel intensity values and performs point processing operations on 
each pixel intensity, based on the neighborhood pixel intensity. Adjustments 
made to the contrast help resolve differences in luminance, as revealed from 


two neighboring surfaces. 


Step 1: Fix the intensity range of values, P and Q, within a particular limit 


for stretching (where limits are 0 & 255). 
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Step 2: Calculate the histogram of the original image, with the lower limit 


R as the minimum value and the upper limit S as maximum value. 


Step 3: Scaling each pixel of an image, I,,,;(, y) is given as: 


lout, Y) = Cin (x,y) — R) {FI (3.2) 


From Eq. 3.2, the range of intensity values can be stretched, and the 
contrast is significantly improved by achieve better results for further 


processing. The contrast is stretched linearly for all the image samples. 
3.4.2 Existing Binarization methods 


Thresholding is a basic binarization method applied to extract the 
foreground information from an image. As far as stone inscription images are 
considered, no standard binarization method has ever been implemented. 
Hence, as part of a testing process, inscription images were tested with all the 


basic binarization methods detailed below: 


From the literature surveyed, it is plain that the Otsu method was 
tried for stone inscriptions in some papers. Beulah et al. (2015) used global 
thresholding, termed the Otsu method, on Grantham characters to minimize 
the intra-class variance and obtain the threshold value. It takes the 
probabilities of two classes called the reduced intra-class and increased inter- 
class. Through the updating and computing process, the threshold value is 
calculated. The same method, applied on 11" century stone inscription 
images, failed to produce satisfactory results. Suganya et al. (2014) 
implemented binarization techniques on stone inscription images, based on 
local thresholding approaches, such as the Niblack (1986) method, which 
computes local threshold values based on the integral image and the size of its 


neighborhood. This method is applied to clearly-captured small script with no 
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breakage in the characters. Their method binarizes well; however, when 
applied on 11" century inscriptions, it is observed that the character pixel is 
misjudged as the background in images with varying illumination. Similarly, 
Sauvola et al. (2000) and Bernsen (1986) also considered the local threshold 
by calculating the local mean and standard deviation. Based on the mean 
value, the foreground pixel is predicted. Hence, to overcome the issues faced 
in existing techniques, the fuzzy-based method is proposed for binarizing 


stone inscription images in the following sections. 


3.4.3 Proposed Modified Fuzzy Entropy based Adaptive 
Thresholding (MFEAT) 


Image binarization is executed by selecting adaptive threshold 
values from among a range of pixel intensities. It relies on pattern 
classification, in which input features called pixel intensity values are 
considered. In general, a binarized image can be obtained by finding the 
threshold value for text extraction from the background image and allocating 
either the black pixel value (0) or the white pixel value (255) to all gray 
levels, based on the selected threshold. Choosing a threshold value is critical, 
since it plays a major role in feature extraction and pattern classification. 
Hence, the adaptive threshold is chosen to avoid misclassifying character 


stroke pixels as background pixels. 


A fuzzy-based iterative approach is proposed for stone inscription 
images in order to determine the adaptive threshold for binarization. Fuzzy 
(Boolean) logic is based on the fuzzy set theory. Using logical reasoning, this 
technique attempts to imitate the reasoning skill of the human brain. Fuzzy 
logic assigns truth values, say binary | or 0, to a pixel and recognizes 
information pertaining to certain characteristics of the pixel in question. The 
role of fuzzy entropy was discussed by Sesadri et al. (2015) in selecting the 


threshold value from a gray-level histogram, using the entropy concept. In a 
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digital image, an array of fuzzy membership values from 0.0 to 1.0 specifies 
the properties of images, such as varying illumination, character strokes, 
edges, and textures. In general, each pixel in an image has a fuzzy 
membership function that belongs to either of the two binary values, 
foreground and background. Hence, the membership function differentiates 
the gray-level intensity between the foreground and the background. 
Therefore, fuzzy entropy (also termed the degree of fuzziness) can be 
computed from the membership function. The concept of fuzzy entropy is 


explained as follows. In general, F is denoted by 


E= (€1»€2»€3, En) EA, (3.3) 


where, A,= {(€1, 2, €3,-- en le; = 0,i =1,2...n,n = 2, i.e; = 1}. The 
Equation above is the entropy probability distribution of an n-ary finite set. 


The entropy of the image is given by H(E) in Eq. (3.4): 


H(E) = — Le, logze; (3.4) 


where, H,(t),H>(t),...H,(t) are the entropies computed for each sub image 
pixel, which are shown in Equations. (3.5) — (3.10): 


Hy(t) = —Yity “/e, ne, (3.5) 
HQ) SS) ea en “Ve: (3.6) 
HORS a en Ve. (3.7) 
E,(t) = — Dirge: (6) (3.8) 
E,(t) = — Yio 91 8 (3.9) 
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Two dummy thresholds are introduced: tp = 0,t, =L—1. The 
threshold T is given by Equation 3.11, 


T (ty, tz, ..ty) = Argmax([H,(t) + H2(t) +--+ H,(@)]) G.11) 


The set S can have a group of elements, say X € Sor X €S.SetS 
and ,(x) called the membership function, is given by Equation 3.12. 


S = {(xus(x))lx €X (3.12) 


The membership function, represented in terms of curve, defines 
every pixel in the input image mapped to it. The proposed system, 
implemented using the Gaussian membership function of n regions, as in 
Kapur et al. (1985) is given by w1,u2,u3...un. The fuzzy entropy of each 


region is given by the following relation. 


a, © 
Hs (x) = exp (- anita (3.13) 
yee @) 2 = 2 ee) (3.14) 
Hy (0 =-Zibger=* In (* : Be) (3.15) 


E,(t) = ar ej * jig(x) tee By(t) = ee * Un(t) (3.16) 


From Equation 3.13, the adaptive threshold value can be computed 
by maximizing the total entropy value and n — 1 threshold values. The results 
after fuzzy entropy thresholding are shown in Figure 3.4, where noises are not 
removed, so the Iterative Bilateral Filter (IBF) is proposed to preserve 


character edges and eliminate unnecessary background pixels from the image. 
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(c) 


Figure 3.4 (Continued) 
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(e) (f) 


Figure 3.4 (a), (b), (c), (d), (e), (f) The binarized image using the MFEAT 


3.4.4 The Proposed Iterative Bilateral Filter (IBF) 


Noise in an image is a random and usually unnecessary variation in 
brightness or color information, caused by sensor noise, which produces 
channel transmission error. Filtering is a preliminary process in character 
recognition, and helps suppress high frequencies by smoothing the image, or 
low frequencies by enhancing it. With this process, the edge pixels of 
characters999 can be detected. Compared to the linear filter, the nonlinear 


filter preserves edges. A bilateral filter (Tomasi et al. 1998) is an edge- 
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preserving nonlinear filtering technique that reduces noise and smoothens the 
image. The intensity value at each pixel is replaced by a weighted average 
value of the intensity values of the pixels nearby. This weight is calculated by 
the Gaussian distribution, denoted by W,. The weights depend on the 
Euclidean distance and the range difference, denoted by Wp. This weight 
preserves the sharp edges of the character pixel by systematically looping the 
edges through each pixel and adjusting the weights with the successive pixels 
accordingly. Let X; ; be the input pixel intensity values, Y;,; the output pixel 


values, and N the number explained through Equation 3.17 


_ Yap=-n Wo (Gb)Wr(Gb)xi+a,j+b 


oj = SN Wo (@d)We ab) ne 
where, 
W, (a, b) = exp — oun (3.18) 
R 
We = exp — i inoue” sc (3.19) 


2 
20R 


The noisy pixels present in the binarized image are located by the noise 
detector and distinguished from character pixels. It also chooses whether or 
not the current pixel has to be filtered. Since the resultant output also contains 
some noise, the system is iterated three times. The noise is completely 


removed to give a clear edge-preserved character, as shown in Figure. 3.4. 
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Figure 3.5 (Continued) 
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Figure 3.5 (a), (b), (c), (d), (e), (f) The filtered image using the IBF 


Detection is based on robust estimation, followed by filtering. The 
noisy pixel is identified by finding the differences between the current pixels, 
and the reference median pixel is calculated to detect noise. If the differences 
are large, the pixel is considered noisy; it is otherwise a noise-free pixel. The 


resulting image is shown in Figure 3.5. 
3.4.5 Experimental Results and Discussion 


Digitally acquired 11" century stone inscription images from the 
Tanjore Brihadeeswarar Temple are taken as input images. This is because the 
unique features and the style of writing Tamil characters evolved in the 11" 
century and became standardized by the 19" century. All the inscriptions in 
the temple are captured using a digital camera. Clear scripts that have not 
deteriorated, and show up in good light on a phone camera are considered. 
Stone inscriptions have hard surfaces, and illumination is the only parameter 
that distinguishes the texture of the character. In all, 250 images were handled 
in this work. The proposed system for character extraction from stone 


inscriptions is designed by combining the MFEAT with a degree of the 


7 ) 
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Gaussian membership function and the Iterative Bilateral Filter (MFEAT- 
IBF). The result of the proposed method is evaluated on various evaluation 
metrics and achieved good results, compared to other binarization techniques 


on camera-captured stone inscription images. 


However, other binarization techniques face a problem in that 
either the noise is not entirely removed or there is a breakage, termed a 
discontinuity, in the character. If the breakage persists in all the characters, 
segmentation cannot be undertaken, and neither can further processing like 
feature extraction and classification be carried out. In the proposed system, 
the breakage in the binarized characters is reduced, as shown below. The 
results of the proposed MFEAT-IBF system are shown in Figures 3.5 to 3.8 b, 
h, n, and t. The output image is clear with no noise, and the characters are 
clear and complete. Figures 3.5 to 3.8 a, g, m, and s show a sample camera- 
captured original image that undergoes processing to separate the foreground 
character script from the background. The result of the Otsu method is shown 
in Figure 11 c, i, o, and u; Sauvola in Figures 3.5 to 3.8 d, j, p, and v; Niblack 
in Figs. 3.5 to 3.8 e, k, gq, and w; and Bernsen in Figures 3.5 to 3.8 f, 1, r, and 
x. In these figures, noise is added and character pixels broken, showing 
discontinuity, and the pixels are meshed, showing meaningless character 
retrieval. Such results culminate in a poor recognition rate. Hence, the 
proposed method extracts characters, without breakage, clearly from the 


background. 


‘aaa’, ANNA UNIVERSITY, CHENNAI - 600 025 (4s) 


was Sasa Ve 
ci9 APS ATOR SBE ATA 


(c) 


sang q Gear agagage Say Saas 
ta " > DDE ith 

Tsaunjaley aang 4G) hikes atone si 
we PAaAAT AMA NTH 393 epee ag- ra 
AACA NII AMAA |, MAN 

a ioe JHE o) im a seid fonsy ‘h Raa ASY i 

Biyayaisishet ads SAO SPUIAI] 

Li Liga a rss pSvg ~ io daaawetg ANIA it 


(e) 


Figure 3.6 


Ag joo Raa \ Ses 


(a) Sample test Image 1: 


66 


gang is) sgasas Seeing) fh 
manessjaheaarzoay Breneess) 
BURNT] OT LAST] MUZA AMF PEI = 3 
HHMI NIZA IMAMMGED. AAR 
® PAgeg am Anau yeng aie Roane. 
eee suPjovestosFaayy]\49031 
BMAP OSG BIO TWRAAMHANAZOLYM AT 


(b) 


VISTAS Ng avis 


lis {oa 1s rig? y 
a Sa ai wi) | eiren ey via 
i" PCA 
patie ANG ee 184 ly ey 
SPRAY Asch ayy. ual PQA 
EP AID AL BY OUT AAA AT\Y: AA 
3 4 FSAI ASABE L FA 


co | ann ae dt 


(d) 


oF Tay 


gana) Sasi saagoaeOmey east 
cagA MONDIAL PTOI ROE 

Jou pSEpSAIy TOBA gM FPR. Bh 
eae N33] aaa aaa 
i PRAYED. ASHEN Ayes rayey 
De m ASG AUS Canvas yo" Raayyyu a M 
uw um Lee) - A TaaO!s AQHA 1 


(f) 


Binarized image _ results 


(b) The proposed MFEAT-IBF (c) Otsu (d) Sauvola 
(e) Niblack (f) Bernsen 
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Figure 3.7 (g) Sample test Image 2: Binarized image results (h) The 
proposed MFEAT-IBF method, (i) Otsu (j) Sauvola (k) 
Niblack (1) Bernsen 
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Figure 3.8 (m) Sample test Image 3: Binarized image _ results 
(n) The proposed MFEAT-IBF method (0) Otsu (p) Sauvola 
(q) Niblack (r) Bernsen 
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Figure 3.9 (s) Sample test Image 4: Binarized image results (t) The 
proposed MFEAT-IBF method (u) Otsu (v) Sauvola (w) 
Niblack (x) Bernsen 


The proposed MFEAT-IBF method overcomes the difficulties 
faced by other binarization techniques. When the proposed method is applied 


ANNA UNIVERSITY, CHENNAI - 600 025 Ay ) 


70 


to badly deteriorated images, it misjudges the character pixel as a background 
pixel. certain image reconstruction methods must be applied before such 
images are subjected to our proposed method. The result is evaluated, based 
on the statistical behavior of the image, in terms of the following: (1) Mean 
(M), which computes the average of the pixel within the selected window, say 
mxn, and replaces the center pixel value; (2) Standard Deviation (SD), which 
is a measure of variation with respect to the mean, and shows the character 
edge sharpening at higher values; and (3) Peak Signal-to-Noise Ratio (PSNR), 
which measures the quality of the resultant image with respect to the 


reference image, called the input image, in a logarithmic scale dB (decibel). 


3.4.6 Performance metrics analysis 


The performance of the proposed system is analyzed through 
metrics such as the mean by computing the average gray levels of the image; 
Standard deviation; PSNR; and the computation of the mean yw in Eq. 3.20 
that denotes the gray-level pixel width W and height H, with g(x, y) denoting 
the gray value. Standard deviation o is computed using Eq. 3.21. The PSNR 
values in Equation. 3.22 are computed by finding the difference between the 
original image X of size (m X n) pixels and the reconstructed image X’ . 
Table 3.1 compares the results of the proposed method with other methods, as 


shown below: 


= Ggler Dag @y) (3.20) 
a= |W veg y) — u)? (3.21) 
2552 


PSNR = 10log 5 
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Table 3.1 Comparison of the result of the mean, standard deviation, and 
PSNR for the sample images shown in Fig. 3.5 to 3.8 between 
the proposed system and other binarization techniques 


Methods Mean (M) Standard deviation (SD) PSNR 
Image 1 (Clear script with dark background) (Fig 3.2 a) 
Bernsen 0.8723 0.091 25.89 
Niblack 0.8573 0.0578 32.87 
Sauvola 0.6245 0.0657 27.54 
Otsu 0.8881 0.0476 43.54 
Proposed 0.4673 0.0145 45.75 
Image 2 (Clear script with light background) (Fig 3.2 b) 
Bernsen 0.6547 0.0843 21.78 
Niblack 0.8478 0.0674 35.73 
Sauvola 0.5784 0.0457 29.64 
Otsu 0.9758 0.0204 39.96 
Proposed 0.3257 0.0369 48.01 
Image 3 (Deteriorated characters) (Fig 3.2 c) 
Bernsen 0.7532 0.0945 9.53 
Niblack 0.9742 0.0571 17,95 
Sauvola 0.6712 0.0342 19.89 
Otsu 0.8703 0.1023 35.67 
Proposed 0.2567 0.0156 47.78 
Image 4 (Deteriorated characters with dark background) (Fig 3.2 d) 
Bernsen 0.8534 0.0937 10.72 
Niblack 0.8479 0.0593 15.73 
Sauvola 0.3567 0.0678 20.76 
Otsu 0.9634 0.0318 30.62 
Proposed 0.4782 0.045 48.92 


A comparison of the results of the PSNR values for various 
binarization methods and the proposed MFEAT-IBF is shown in Table 3.1. 


Compared to other binarization techniques, the proposed method offers good 
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PSNR values and foreground character extraction. The proposed fuzzy 
thresholding method outperformed the Otsu, Niblack, Bernsen, and Sauvola 
methods. Though the captured images vary in terms of illumination and stone 
color, the proposed method binarizes and extracts foreground characters with 


a good recognition rate. 
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Sample stone inscription Images 


Figure 3.10 A comparison graph for the PSNR values 
i a SUMMARY 


Very little research has been carried out on stone inscription 
images, and an efficient preprocessing method that binarizes images for 
character extraction is lacking. The proposed system has been presented in 
this chapter. It combines modified fuzzy entropy-based adaptive thresholding 
with the Gaussian membership function and iterative bilateral filter, and 
differentiates foreground characters from the background on 11" century 
stone inscription images efficiently. Character extraction is quite easy in 
images with a contrasting foreground and background, as in the case of 


document and handwritten images, whereas stone images have very low 
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extraction rates, owing to their undifferentiated foreground and background. 
Other complications include varying lighting conditions while capturing the 
image, writing styles, natural stone color, breakage in the stone, and 
characters with deteriorated portions. For such applications, existing global 
and local thresholds do not work, since the global threshold works only on 
images with contrast differences in the foreground and background. In the 
local threshold, there are different values for each pixel, and hence it 
misclassifies a foreground character as a background one. Hence, the 
proposed method of adaptive thresholding using a fuzzy-based approach with 
an iterative bilateral filter has helped achieve a better PSNR value than other 


existing techniques. 
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CHAPTER 4 


INSCRIPTION CHARACTER SEGMENTATION 


4.1 INTRODUCTION 


Image segmentation is the process of decomposing a complete 
image into multiple segments. It assigns labels to every pixel that help detect 
boundaries, based on characteristics like intensity or color. The detected 
regions are strongly related in terms of features of interest. Pixels in the same 
region have the same intensity values and form a connected region. This 
chapter proposes a segmentation algorithm for segmenting individual 


characters from binarized stone inscription images. 
4.2 SEGMENTATION: AN OVERVIEW 


Selecting an algorithm is a key factor that determines the level of 
detail present in an image and, further, decides the accuracy in the recognition 
rate of the system. Basically, character image segmentation is subdivided into 


a three-step process, as follows: 


e Line segmentation 
e Word segmentation 


e Character segmentation 


Five major approaches are involved in image segmentation: 


classical, explicit, implicit, hybrid and holistic. 
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Classical approach — It divides images into sub images, using a 


process termed dissection. 


Explicit approach - Images are classified, based on prespecified 


windows. 


Implicit approach — Spatial features are extracted, from which 


the whole image is classified. 


Hybrid approach — Images are dissected, based on 


recombination rules. 


Holistic approach — Words are recognized as a whole. 


In a stone inscription image, segmenting a single complete 
character is a challenge, given the unconstrained nature of the characters 
themselves. Insights have been drawn from several studies on handling 
handwritten document images for a range of languages. This research 
proposes to segment stone inscription images in a two-phase approach 


comprising line segmentation, followed by individual character segmentation. 
4.3 PROBLEM DEFINITION 


Handling stone inscriptions is a challenge, because of their complex 
background. Despite the preprocessing carried out, decomposing _ script into 
lines, followed by characters, is difficult. This is because the characters are 
inscribed on hard stone surfaces and the nature of curvature shapes is to be 
factored in as well. The challenges involved in segmenting stone inscription 


characters are listed below: 


e Noises in the background are also predicted as character pixels. 
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e Characters are occasionally broken on account of illumination 
while being captured using a digital camera, and _post- 


preprocessing as well. 
e There are characters that overlap and touch in portions. 


e Differences in intensity, pointing to irregularity, show up within 


characters. 


e Non-touching segments in consonants, referred to as upper 


curves, are to be dealt with. 


4.4 IMPLEMENTATION OF CONNECTED COMPONENTS 
WITH THE BOUNDING BOX ALGORITHM 


The binarized image, comprising continuous structured characters 
with minimal space between each, is taken as input. The image with the 
complete script is to be segmented into separate characters, and this is done 
through separating each character by finding a boundary. The connected 
components are identified by grouping pixels, based on their connectivity. 
Thereafter, pixel groups that share the same intensity values are labelled. The 
image is scanned row-wise by the labeling operator until it finds groups of 
neighboring pixels. After fixing the boundary, the outer edges of a character 
are detected. Here, 1- and O- pixels refer to pixels with values of 1 and 0 
respectively. Four connected pixels make up a pair of pixels which are 
adjacent to each other, where one pixel is to the top, bottom, left or right of 
another pixel. Similarly, in 8 connected pixels, one pixel can be diagonally 
adjacent to another pixel, apart from the positions mentioned in the 4 
connected pixels. A connected component is a continuous set of 1- or 0- 
pixels which are 8/4 connected with each other. The image is raster-scanned 
so pixels that are connected are scanned prior to their arrival at the empty 


space where the boundary is detected. The surroundings of the connected 
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pixel are marked with the bounding box. Similarly, it continues to scan the 
next character in the image. The implementation of the proposed system on 
the preprocessed stone inscription image involves finding the connected 


components, followed by the bounding box. 


Algorithm Steps: 


Input: Preprocessed binary image 


Output: Labelled image 


Step 1: Input binary image 
Step 2: 8-connectivity is used for labelling the input image 
Step 3: Run length scans the black and white pixels in the input image 
Assigns the label and records in equivalence table 
Identify equivalence classes 
Relabeling the runs from the equivalence classes 
Step 4: The above steps identifies the connected components in the input 
image 
Step 5: To segment each character a bounding box is plotted 
Step 6: The vectors of indices is found to locate complete character pixel 
Step 7: Based on indices the coordinates of the character is identified 


Step 8: Repeat step 6 and 7 until the connected components ends. 
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Figure 4.1 (a) Sample inscription image, (b) Segmented characters 
using the bounding box 

The literature surveyed makes it plain that the most common technique 
applied on inscription images is the bounding box and manual cropping of 
characters. The implementation process on the stone inscription image 
involves finding the connected components with the bounding box. The 
drawback in applying the process to the inscription image is shown in Figure 
4.1 (b). The red arrows represent the segmented image that the single 
character is divided into multiple parts. There are two problems with finding 
the connected components for a stone inscription: (1) Consonants with an 


upper curve are segmented and stored separately, which tends to change the 
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meaning of the character. (11) Broken characters lead to multiple segmentation 


of a single character. 
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Figure 4.2 Images stored after segmentation 


Figure 4.2 shows images segmented and stored in the database 
using the bounding box algorithm. Another drawback of this method is that 
segmented characters are stored randomly. They get shuffled, which creates 


difficulties during transliteration. 


4.5 CALLENGES HANDLED BY THE PROPOSED 
METHODOLOGY 


A new technique is proposed for segmenting stone inscription 


characters which automatically segments lines, followed by characters. Poor 
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quality, noise, and age-related degradation continue to remain challenges in 
recognizing characters from stone inscriptions. The literature reviewed makes 
it clear that the related work on Tamil stone inscription images, as it pertains 
to our research, is rather limited when compared to work in languages like 
Kannada, Devanagari, Brahmi, and Gurmukhi. The proposed method 
overcomes the drawbacks encountered in existing methods, particularly as 


they relate to the segmentation of inscription characters. 


Non-touching upper curve for consonants 


Much less spacing among characters in horizontal lines. 


gir enki nae Pe eg 
nie qopoyery2anSulnee aang 
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Unconstrained nature of the character E.g. ‘va’. (many variations in 


single character) 


Skewed character 


was aE Gy 


Irregular intensity among the characters. 


a ONAL 


A variable thresholding technique is proposed for line and character 


segmentation and constitutes the following: 
e Horizontal Baseline Segmentation (HBS) for segmenting lines. 


e Vertical Character Segmentation (VCS) for segmenting lines 


into single characters. 


e Feature-based segmentation for non-touching characters with 
minimal spacing, by means of line and curve extraction, using 


the Progressive Probabilistic Hough Transform (PPHT). 


4.5.1 Horizontal Baseline Segmentation (HBS) 


The complete binarized inscription script is taken as input. Here, 


since the foreground character is a black pixel, say ‘0’ and the background 
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character is a white pixel, say ‘1’, the intensity values are clustered into two 
dominant modes. Since the script has clear spaces between every line, it can 
be decomposed into separate lines by considering white pixels on selecting a 
threshold value, ‘T’. Let us consider script as ‘S’, as shown in Equation 4.1, 


where S € {0,1}. 


i=12,..7 
54j = li =12,..¢ a 


where r, c denotes white pixels along the rows and columns. All white pixels 
running through rows ‘r’ are added up to give the output, ‘Y’, as shown in 


Equatio 4.2: 
Y= Vier Sep) = U2. (4.2) 


Computing the average of the pixels in every row gives a particular 
value. Similarly, the process is continued for all the rows in the script, starting 
from the top left to the bottom right horizontally. After computation, it is 
noticed that the horizontal line space between the characters along certain 
rows has the maximum white pixel values, compared to lines where 
characters are present. The mid-horizontal line of maximum white pixels 


along row is detected to generate a histogram plot. 
HL = (AL €Y,|¥, < 1,j = 1,2...c) (4.3) 


To segment the line, the maximum threshold value is computed, 
based on the histogram plot. The plot peaks only for lines that have a 
maximum value of white pixels along the rows. It is thus that the proposed 


HBS algorithm helps segment lines. 
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Algorithm steps: HBS 


Input: Full binarized script image 


Output: Horizontal line segmented image 


Step 1: Preprocessed image is scanned horizontally 

Step 2: Count number of white pixels in a row in x direction 

Step 3: Sum all the pixels in every row 

Step 4: — The space in between each line shows the majority of white pixels 
Step 5: Plot histogram based on the character projections and white pixels 
Step 6: From the histogram the adaptive threshold value is identified 

Step 7: Selects the mid white pixels row and segment the line 


Implementation of an algorithm: (Line segmentation) 


Begin 


def fun(image): 


arr=np.asarray(image) 


tem=[] 
for iin range(image. width): 
tot=0 
for] in range(image. height): 
tott+=arr[j][i] 
tem.append(tot) 


return(tem) 
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def index(arr1): 
ma=0 
index=0 
for iin range(len(arr1)): 
if(arrl [i] >ma): 
maz=arrI1[i] 
index=i 
return index 
def ma_(arr): 
print("came to ma") 
tem=[] 
print(len(arr)) 
plt.plot(arr) 
plt.show() 
thresh=thresh_adaptive(image, offset=15) 
for i in range(len(arr)): 
if(arr[i]>num): 
tem.append(i) 
k=[] 
for iin range(len(tem)-1): 
if(tem[i+1 ]-tem[i]>10): 


k.append(i) 


a 
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k.insert(0,0) 
k.append(len(arr)-1) 
avg=[] 
for iin range(len(k)-1): 
avg.append(tem[index(tem[k[i]:k[i+1]])+k[i]]) 
print(avg) 
return(avg) 
def corp(imagel,arr,s): 
print("came to corp") 
arr.insert(0,0) 
print(arr) 
for iin range(len(arr)-1): 
img=imagel.crop((arr[i],0,arr[i+1 ],imagel.height)) 
sl=st+'/"+str(i)+" jpg" 
file=open(s1,"w+") 
img.save(file) 
sl=str(input("enter the path")) 
file=os.listdir(s1) 
asdas=file.pop(0) 
print( "file" file) 
image=[] 


img_arr=[] 


a 
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import numpy as np 
for i in range(len(file)): 
image.append(Image.open(s1+"/"+file[i]).convert("L")) 
img_arr.append(np.asarray(image[i])) 
image=[] 
img_arr=[] 
import numpy as np 
for i in range(len(file)): 
if(file[i}[-3:]=='jpg'): 
image.append(Image.open(s1+"/"+file[i])) 
img_arr.append(np.asarray(image[i])) 
for i in range(len(image)): 
kel=fun(image[i]) 
print(max(kel)) 
print("image",i) 
line=ma_(kel) 
try: 
os.mkdir(s]+"/"+"LC"+str(i)) 
except: 
print( "already present") 


corp(image[i],line,s1+"/"+"LC"+str(i)) 
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4.5.2 Results and Discussions of HBS 


The implementation algorithm above is applied over the preprocessed 
image, as seen in Figure 4.3 (sample image 1) and Figure 4.7 (sample image 
2). The input image is raster-scanned to find the most number of white pixels 
on the horizontal line. Raster scanning helps subdivide the complete script 
into separate lines automatically by detecting the threshold values, as shown 
in Figure 4.6 and Figure 4.9. Their corresponding histogram plot is given in 
Figure 4.4 and Figure 4.8. Every peak attained in the histogram plot shows the 
horizontal white space along the rows, and a screenshot of the threshold 
values for segmenting lines is generated automatically, as shown in Figure 4.5 


and Figure 4.9. 
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Figure 4.3 Sample image 1: Input image for line segmentation 
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Figure 4.4 Image 1: Histogram plot 
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Figure 4.7 Sample image 2: Input image for line segmentation 
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Figure 4.10 Image 2: Line segmented image using HBS 
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4.5.3 Vertical Character Segmentation (VCS) 


The line-segmented script image is given as input. Similar to the 
HBS, the VCS is proposed here to segment the character where the number of 
white pixels running vertically (that is, the column pixels) is summed up. This 
logic is applied, considering that the inscribed characters have the minimum 
white space between them. Apart from the character’s black pixels, the 
complete white pixels along the columns give the maximum value. A 
common threshold point is given by the histogram plot which helps segment 
the character. The algorithm is designed such that the segmented characters 
from each line will be stored in a separate folder in the database. The 
advantage of this algorithm is that characters are stored in the same order as in 
the input image. Sub-consonant characters are also segmented correctly. 
Whereas, in the traditional bounding box and connected components 
algorithm, characters are shuffled and single compound characters are split 


into two. 


Algorithm steps: VCS 


Input: Line segmented image 

Output: Character segmentation 

Step I: line segmented image is scanned vertically 

Step 2: Count number of white pixels in a column in y direction 

Step 3: Sum all the pixels in every column 

Step 4: The space in between each line shows the majority of white pixels 
Step 5: Plot histogram based on the character projections and white pixels 


Step 6: From the histogram the adaptive threshold value is identified 
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Step 7: Selects the mid white pixels column and segment the character 
Implementation of an algorithm: (Character segmentation) 
img_=[] 
def load_img(path): 
files=os.listdir(path) 
img=[] 
for iin range(len(files)): 
tem=Image.open((path+"/"+files[i])) 
img.append(np.asarray(tem.convert("L"))) 
img_.append(Image.open((path+"/"+files[i])).convert("L")) 
return img 
def sum_(img): 
tot=[] 
for iin range(len(img)): 
tot.append(sum(img[i])) 
return(tot) 
def crop(img, tup, path): 
print(tup) 
img.crop(tup).save(path) 
def returning(tem, threshold): 
teml=[] 


tot=0 
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93 
index=0 


for iin range(len(tem)): 


if(tem[i]>threshold): 
tot+=1 
index+=1 
elif(tot!=0): 
tem1.append(tot/index) 
index=0 
tot=0 
return(tem] ) 
string_file=str(input("enter files name")) 
images=load_img(string_file) 
string_store=str(input("enter the path to store")) 
for i in range(len(images)): 
tem=sum_(images[i]) 
plt.plot(tem) 
plt.show() 
thresh=thresh_adaptive(image, offset=15) 
teml=returning(tem, thresh) 
tem1.insert(0,0) 


print(tem1) 
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for] in range(len(tem1)-1): 
crop(img_[i],(0,tem1[j],len(images[i][0]),tem1[j+1]),string_store+"/"+str(i) 
+str(j)+" jpg") 


4.5.4 Results and Discussions of VCS 


The line-segmented image from Figures 4.10 and 4.12 is given as 
input to the VCS, and the algorithm finds the vertical white space along the 
column. Similar to the HBS, the histogram plot is generated for each input 
line given. From the majority of peak values, the threshold value computed 
helps segment the line script into single characters, as shown in Figure 4.11 
and 4.13. When the VCS is applied, overlapped or minimally-spaced 
characters are not segmented individually, as shown in Figure 4.14. The VCS 
thus affects automated segmentation. Features are extracted from each 
character, and based on these features, characters can be segmented, as 


explained in the following section. 


Figure 4.11 Image 1: Complete script segmented into individual 
characters 
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Figure 4.13 Image 2: Complete script segmented into individual 
characters 
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Figure 4.14 Image 2: Character Segmentation 


4.5.5 Feature based segmentation using PPHT 


The Progressive Probabilistic Hough Transform (PPHT) is used to 
extract lines and curves from the character image. The PPHT is independent 
of character size, shape and noise, and minimizes computation time. Features 
extracted by the PPHT are used to segment overlapped characters. The PPHT 
is proposed for stone inscriptions to detect lines and curves in characters with 


very little space between them. 


Algorithm steps: Line Detection by PPHT (LDPPHT) 


Step I: Read the segmented character 
Step 2: Find the edges of the image using canny edge detector 


Step 3: Find vectors for the line x1, 22, yl, v2 


aaa ANNA UNIVERSITY, CHENNAI - 600 025 (4s) 


97 


Distance resolution in pixel p 
Angle resolution in radians @ 
Step 4: — Fix the threshold value 


Step 5: Find the minimum length of the line and remaining will be 


rejected 
Step 6: — Find the maximum line gap between the line joining 
Step 7: Detect corners by aperture type and border detection 


Step 8: Detected will be displayed in the output 


Algorithm steps: Curve Detection by PPHT (CDPPHT) 


Step I: Read the input segmented image 


Step 2: Applying Gaussian blur for avoiding false circle detection 
Step 3: Circle detection by storing 3 vector values called x, y, z 
Step 4: Set the inverse ratio as ir=1 

Step 5: Find the minimum distance from center of the circle 
Step 6: Set upper threshold as ut=100 

Step 7: Set center detection threshold as ct=200 


Step 8: Detect the minimum and maximum radius of the circle. 


Step 9: Draw and display the detected circle. 
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Figure 4.15 Segmented result from VCS 


In Figure 4.14, the red arrow shows segmentation results after 
applying the VCS. Since some characters overlap as shown, they are grouped 
together and segmented. To resolve this problem, the features of the 
characters are extracted and, based on the curves and lines, the characters are 
separated. Shown below in Figure 4.16 and 4.17 are samples of characters 
that shows lines and curves extracted with the applying of the PPHT. Based 


on the count of features, overlapped characters are segmented separately. 


Figure 4.16 Overlapping characters 
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Features \ | \_ ie D 


identified 


Figure 4.17 Samples of line feature extraction 
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Figure 4.18 Samples of curve feature extraction 


The features of each character are extracted, based on which 
overlapped characters are separated. The line and curve extraction provide 
details of each individual character, and the extracted features are stored in 
the database. When an overlapped character is given as input to this 
algorithm, it first predicts the features before comparing it with features stored 
in the database. Once the complete character is predicted, overlapped 


characters are subdivided into single characters each. 
4.6 SUMMARY 


As far as stone inscription are concerned, segmentation carried out 
applying manual cropping makes it difficult for a comparison to be drawn 
against the other existing technique. Much of the research has focused solely 
on document and handwritten images, and very few on inscription images. A 
majority of studies have worked on connected components with the bounding 
box algorithm. Very little work has been carried out on inscriptions, and that 
in languages other than Tamil. No strong segmentation algorithm has so far 
been proposed for Tamil stone inscription images. For the most part, images 
have been cropped followed by feature extraction and classification. No 
automated segmentation was implemented to segment inscription characters. 


The proposed approach, therefore, is the first initiative of its kind. It involves 
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the HBS, VCS and feature extraction-based on PPHT segmentation, and has 
been compared to the connected components algorithm. Comparatively 
speaking, the proposed approach offers a new solution to the automatic 
segmentation of inscription characters. Further, the drawbacks faced in 
implementing the connected components algorithm have been resolved by the 


proposed approach. 
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CHAPTER 5 


CLASSIFICATION BY PATTERN MATCHING 


5.1 INTRODUCTION 


This chapter deals with the proposed machine learning model that 
recognizes and classifies ancient stone inscription characters. Pattern 
recognition is a key component of any recognition system used for data 
processing and the corresponding decision making. It is a discipline that trains 
machines to perform perceptual tasks, just like humans. There is a practical 
need to find an efficient way to recognize characters from handwritten images 
and real images. Pattern recognition algorithms identify matching patterns for 
input images with pre-existing patterns by considering their statistical 
variations. Pattern matching is the process of categorizing a set of pixels to 
determine the presence of patterns from raw data. In this chapter, character 
recognition from stone inscription images is tested with the Speeded-Up 
Robust Feature (SURF) and the proposed Modified Speeded-Up Robust 
Feature with the Bag of Graphemes (MSURF-BoG) machine learning models. 


ha a AN OVERVIEW OF THE PROPOSED PATTERN 
MATCHING SYSTEM 


The system aims at identify all possible character pixels in the 
given input so as to provide a textual description of the characters. The dataset 
handled in this work comprises 11" century stone inscription image 


characters. The Image-based Character Pattern Identification (ICPI) system is 
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broadly divided into two phases, training and recognition. In the training 


phase, characters are manually trained and stored into the database. 


Preprocessed stone Individual character 


Database 


‘ inscription image 


Feature extraction 


Detect Slicing the Create code word Histogram 
strongest images into by Bag of plot 
interest points patches Grapheme 


If strongest key 
points detected? 


Create a template 
Similarity matching 


Cross- Character not 


correlation value recognized 


Character recognized 


Figure 5.1. Architectural flow of the Image-based Character Pattern 
Identification (ICP system. 
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In recognition phase, feature extraction is carried out on the 
preprocessed stone inscription image. This is followed by similarity 
estimation and recognizing characters by matching with the training phase. 


The architectural flow of the ICPI system is shown in Figure 5.1. 
5.2.1 Image-based feature extraction 


Feature extraction finds abstractions in image pixel information 
and_ decides which pixel points have information on the relevant features. It 
starts deriving feature attributes from the initial set of input data, and 
subsequently learns character features by means of generality steps, which 
surpasses human interpretation. Character recognition using the Speeded-Up 
Robust Feature (SURF) algorithm developed by Bay et al. [3] constitutes 
three stages: (i) Feature Point Detection (FPD), (ii) Confined Region 
Description (CRD), and (iii) Feature Matching (FM). FPD is the process of 
locating the strongest points, called interest points, on character edge pixels 
by eliminating the background pixel. The SURF constructs the scale space as 
a uniformly-spaced level of octaves. The integral image is constructed, on 
which the 2" derivative filter is applied. An integral image is constructed to 
up sample the filter rather than down sample it. The local maxima sample 
space is found, on which quadratic interpolation is applied. The integral 


image is given by Equation 5.1 
1(X) = Dio tiny) (5.1) 


where, J, (i,j) is the input image and /(X) is the obtained integral image, 
where each point at X is (x,y)!. The integral image is calculated, based on 
the response of the rectangular area, say P,Q,R,S as shown in Eq. 5.2. The 


rect area property performs fast convolutions with different-sized box filters. 
rect =P-—-Q-R+S (5.2) 
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Blob detection is done, based on the Hessian matrix, by deriving a 
scalar-valued function through partial derivations. In general, the matrix is 
given by Equation 5.3. The determinant value helps find the maxima and 
minima values by second-order derivative. If the determinant value is 


negative, the point is not local extremum, and if it is positive, the point is 


extremum. 
07h 07h 07h 
QZ2 AZZ, —S—si« Lp 
H= : *, : (5.3) 
07h 07h 07h 
OZn0Z, O2Z,0Z, ~+«—aaz 


The derivatives are calculated by convolution with kernel, which 
are constructed for the Gaussian matrix x,y. Four entries of the Hessian 
matrix is constructed alongside. The use of the Gaussian helps with the 
smoothing during the convolutions. Let us now construct a Hessian matrix, 
H(q). Let q = x,y be the scale space point, and G,,.(q) be the convolution of 
the 2" order Gaussian derivative given by Equation 5.4. The interest points 


can be detected by finding the Hessian matrix at every pixel. 


Gxx(Q,P) Gxy(q, D) (5.4) 


ch tea Gyy (4p) 


A box filter of a 9 x 9 matrix is considered for the approximation of 
the Gaussian function. The strongest points can be found from the Gaussian 
scale value and here it is set at 0.5, because the SURF can handle only the 
lowest scale space. The strongest 128 points are identified for each character 
and stored in the database. For testing an image, the strongest points will be 
identified and compared with the trained image stored in the database. Once 
the features are matched, the SURF labels the output with the transliterated 


version of the Tamil alphabet through manual labelling. Hence, here it carried 
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out a feature-to-feature comparison. The SURF algorithm for the training and 


testing phases is given below. 


Algorithm steps: SURF algorithm 


Feature extraction in training phase 
Read the file; 
Scaling Factor = 0.5; 
image RGB = imresize (read image (character, ext), scaling Factor); 
total images = numel (character); 
for 
n= |: total images do 
boxchar1 = imread (input file name); 
boxchar = rgb2gray(boxchar1); 
strongest key points = SURF Features detect (boxchar); 
plot (select Strongest (points, 100)); 
[Features, Points] =extract features (boxchar, Points); 
end for 
Feature extraction in testing phase 
char = dir (full folder (image char folder, .jpg); 
total images = numel (char file); 
char imagel = imread (char file); 


char image = rgb2gray(charimage1); 
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strongest points = detect SURF features (char image); 
plot (select strongest (strongest key points,300)); 


[char features, strongest key points] = extract features (char image, 


strongest key points); 
Matching Features 
box Sets = match features (box features, char features); 
if 
is empty (box sets) 
then display (No match); 
else 


matched strongest key points box = strongest key points box 
(box sets (:1), :); 


matched char points = char points (box sets (:2), :); 
figure; 
show; 


matched features (box char, char image, matched strongest 


points box); 


endif 


Using the SURF algorithm, features are extracted for each character 
as well as for the different orientations of the characters. ‘n’ number of 
comparisons are needed, after a search of the whole database, to recognize a 
particular character. Though the SURF algorithm is able to detect features and 
recognize characters, the time complexity is high. This is because it searches 


the entire database and compares all the characters and their variations. 
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Hence, to reduce the time complexity, the SURF algorithm is upgraded with 
the creation of a code word model for each character called the Bag of 
Graphemes (BoG), resulting in the MSURF-BoG. The steps of the proposed 


algorithm are shown below. 


Algorithm steps: BoG Algorithm 


Proposed BoG Algorithm 
Step1 : Begin 
Step2 : Extract and Store the different variations of single characters ‘D’ 
Step3 : Fix the proposition of training & testing module (0.7, 0.3) 
Step4 : Slice the image based on the SURF feature 
Step5 : Construct the Histogram of images 
Identify SURF points in each image in the folder 
Group SURF points to construct small set of patches 
Count the number of patches 
Plot histogram bar 


Step6 : Select top 500 histogram to construct common visual code 
for group characters 


Step7 : Construct SURF points for test image 
Step8 ; Construct code word for test image 
Step9 : Match common features with the test image 


Step10: Label the character ‘Z’ with the modern Tamil alphabets 


Step 11 : End 
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5.2.2 Implementation and Result Analysis 


In this work, present experimental results of a pattern matching 
algorithm in the context of recognizing characters from ancient Tamil stone 
inscription images. The input images used here were digitally acquired with a 
high-end camera. First of all, SURF features were extracted from the training 
stone inscription images, which predicted 128 of the strongest key points from 
the said images. The same 128 points were extracted for all the characters 
from the stone inscriptions and stored in the database. Once the testing images 
were given, the same SURF features were extracted from the images and 
compared with the database containing the trained image features. The 
drawback of the SURF algorithm is that it runs through the entire database in 
order to recognize a single character. It then compares the character with the 
massive trained dataset, which is a time-consuming process. The results of 
implementing the SURF algorithm are shown in Figure 5.2 The sample input 
image shown in Figure 5.2 (a) is preprocessed and the SURF algorithm 
applied on the image. The algorithm predicts the strongest points, as shown in 
Figure 5.2 (b), and the similarity detection is shown in Figure 5.2 (c). The 
strongest key point detection for each character in the testing image is shown 


in Figure 5.2 (d) to Figure 5.2 (k). 


(a) (b) 
Figure 5.2 (Continued) 
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(e) 


(g) 


Figure 5.2 (Continued) 


a 
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(j) (k) 


Figure 5.2. Implementation result of MSURF-BOG algorithm: a) Input 
image b) Selecting strongest feature points c) Matching 
cluttered points d) & e) Strongest feature points and 
matching the letter ‘o’ f) & g) Strongest feature points and 
matching the letter ‘pa’ h) & i) Strongest feature points and 
matching the letter ‘la’ j) & k) Strongest feature points and 
matching the letter. 

To resolve the drawback of time consumption, the Modified 
SURF-BoG feature algorithm is proposed. The standard Tamil alphabet 
comprises 12 vowels and 18 consonants. However, the ancient 11" century 
characters under consideration had 7 vowels and 17 consonants, making of 24 
characters in all. For the purpose of this research, 25 samples with different 


orientations from each of the 24 characters, totalling 600 characters in all, 


were cropped for training the system. For a single character, 25 variations of 
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Table 5.1 Sample input image of certain characters, the histogram plot, 


and its corresponding accuracy rate 
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samples were taken and the characters sliced into 500 patches each. Each 500- 
patch character was compared against every other character to get a unique, 
finalized similar patch called the BoG. This is stored in the form of a 
histogram called code word creation in the database. Similarly, the process 
was repeated for all 24 characters. Once a testing character is given, it slices 
images into 500 patches each and generates a histogram. Thus, the testing 
histogram searches for the matching histogram in the database. With 
correlation template matching, the system recognizes and labels the 


characters, as shown in Table 5.1. 


The experiments were performed on camera-captured Tamil stone 
inscription image. The recognition rate can be found by R, as given in Eqs. 


(5.5) & (5.6) 


NM=N-SM (5.5) 


R ==" x 100 (5.6) 
where SM is the Similar Match, NM a Non-similar Match, and N the number 
of test samples. The computation results for all 24 characters are shown in 


Table 5.2. It is observed that the proposed MSURF-BoG method achieves a 


higher recognition rate than SURF algorithm for inscription characters. 


Table 5.2 A comparison of the recognition rate for the SURF and 


MSURF-BoG 
SURF Proposed SURF- 
5. No,| Sample test | Modern Tamil BoG 
‘""""| characters character ny Gha ke R ler Ni R 
(%) (%) 
1 | 3 25/15 [10 |60 [25/20 [5 |so 
2) Oy, 25]13 |12 |52 25]21 |4 |s4 
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Table 5.2 (Continued) 
S. No. Sample test | Modern Tamil PERE arr “ 
characters character N|SM|NM R N |SM/NM R 
(%) (%) 
3 | (a) 25/16 |9 |64 |25]18 |7 1/72 
4 a 25/11 |14 |44 |25]19 |6 |76 
5 | iy 61 25/16 |9 64 |25|20 /5 [80 
6 | BBR a) 25|14 111 |56 |25|16 |9 |64 
7 | & 25/13 |12 |52 |25|20 |5 |80 
8 | a & 25/16 9 |64 |25|21 |4 |84 
>| i Trl 25/12 |13 |48 |25|19 |6 |76 
10 | & 25/14 |11 |56 |25|17 /8 [68 
11 & 25/10] 15 | 40 |25| 16] 9 | 64 
12 L 25}16) 9 | 64 |25) 17] 8 | 68 
13 600T 25/14] 11 | 56 [25] 21] 4 | 84 
14 5 25/17] 8 | 68 |25| 18) 7 | 72 
15 ii 25/20] 5 | 80 |25| 20) 5 | 80 
16 u 25/21] 4 | 84 |25| 22] 3 | 88 
17 LO 25/22] 3 | 88 |25| 22] 3 | 88 
18 WwW 25/18] 7 | 72 |25|19| 6 | 76 
19 1) 25|20| 5 | 80 |25| 22] 3 | 88 
20 Sy) 25/21] 4 | 84 |25| 20) 5 | 80 
2] eu 25/19] 6 | 76 |25}21| 4 | 84 
22 Lp 25/15 | 10 | 60 |25| 17] 8 | 68 
23 er 25/17] 8 | 68 |25| 19] 6 | 76 
24 6OT 25/18] 7 | 72 |25|20) 5 | 80 
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Figure 5.3. A comparison of the character recognition rate using the 
pattern matching algorithm 


A SUMMARY 


The SURF and MSURF-BoG algorithms have achieved an overall 
accuracy of 71.33% and 77.5% respectively in terms of character recognition 
Using the pattern matching algorithm, only 25 samples from each character 
needed to be trained. This has helped compute the histogram and identify 
characters. Though the proposed algorithm recognizes characters, the 
drawback is that the computational time is high, given that it searches the 
entire database to recognize every character. The pattern matching algorithm 


misclassifies similar kinds of characters. 
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CHAPTER 6 


FEATURE EXTRACTION AND CLASSIFICATION 
USING THE ARTIFICIAL NEURAL NETWORK 


6.1 INTRODUCTION 


A feature extractor is essential for efficient data representation and 
extracting meaningful features for subsequent processing. The feature 
extraction technique is classified into two: global and local. A Combination of 
these two features is used for recognition. Feature extraction derives values 
that are informative and non-redundant from images. Image classification 
helps extract information classes through supervised and unsupervised 
learning processes. This chapter describes a method for feature extraction, 
classification and recognition of characters through the ANN from stone 


inscription images. 
6.2 FEATURE EXTRACTION 


The local features such as area, moment, entropy, loops and are 
extracted from stone inscription images. The area is the total number of pixels 
in a region, and is calculated by multiplying the product of its height and 
length. The length of a character is computed by subtracting the column 
number of the last pixels from the column number of the first pixel. The 
height of a row is calculated by subtracting the row number of the last pixel. 
Entropy provides standard information on an image in terms of luminance, 


contrast and pixel value. Character shapes are either circular or curved, 
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forming a loop. The filled area within the closed loop is to be calculated. The 
junction, which is the interface between two regions of a character, helps 
discover the slant angle. The extracted feature is given as input to the 
classification process. Classification identifies to which of a set of categories 
a new observation belongs, based on a training set of data containing such 


observations. 
6.2.1 Quantitative features 


Quantitative features refer to global features that have the ability to 
generalize an entire object with a single vector. Object recognition systems 
routinely use global features that describe an entire image. Global Features 
are extracted from whole images by taking into consideration low-frequency 
coefficients, as well as global image features including color, texture and 


shape. 
6.2.2 Qualitative features 


Qualitative features refer to local features that have patterns or 
distinct structures found in images, such as a points, edges, or small patches. 
They are usually associated with an image patch that differs from its 
immediate surroundings in terms of texture, color, or intensity. Local features 
are high-frequency and dependent on the position and orientation of images. 
Table 6.1 below lists the quantitative and qualitative features of a character 


image. 


Table 6.1 A list of quantitative and qualitative features 


Quantitative features Qualitative features 
Area Horizontal Angle 
Moments Vertical Angle 
Filled Area Left Slant 
Eccentricity Right Slant 
Entropy Extent 
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Feature extraction is the process of studying and deriving useful 
information from a filtered input inscription image. The derived information 
can be general features that are evaluated to ease further processing. Feature 
selection is crucial because there could be one or two values that are 
significant in character recognition, especially of a particular character. The 
performance of the recognition system relies greatly on the quality of the 


features extracted. 


6.2.3 Extraction of global and geometric features 


Global and local features are two types of features extracted from 
inscription images. Global features consider characters as images, and are 
extracted from the texture of the image using Gabor filters and cooccurrence 
matrices. Geometric features are external features, and the proposed system 


lays great emphasis on them. 


Gr $ Universe of Zoning Assign starters, intersection 
discourse -——>) and minor starters 


Calculating eccentricity Finding direction Extraction of line 

extent and orientation of an element segments 

Calculate Calculating : 
—P> /—_> Feature vector of an image 


entropy Zernike moments 


Figure 6.1 Overall architecture diagram for feature extraction by using 
local and geometric feature 
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6.2.4 The Universe of Discourse 


The universe of discourse is defined as the shortest matrix that fits 
the entire character skeleton. The universe of discourse is selected for feature 
extraction, based on the positions of different line segments in the character 
image. After the selection of the universe of discourse, the image is divided 
into equal-sized matrices, and feature extraction is done on individual 
matrices. Each matrix, called a zone, offers additional information about the 
unique details of a character skeleton. To extract different features in a 
particular zone, the entire skeleton image in the particular zone must be 
traversed. Each zone extracts features and the process continues until features 
from all the zones in the skeleton are extracted. Traversal stops with a visit to 
every pixel. Every zone creates a feature vector corresponding to it. The 
different features taken include the total number of lines (horizontal, vertical, 
right diagonals, and the left diagonals), the length of lines (horizontal, 


vertical, the right diagonals, and left diagonals), and the area of the skeleton. 
6.2.5 Zoning 


The frame containing the character is divided into several 
overlapping or non-overlapping zones and the densities of object pixels in 
each zone are calculated. Density is calculated by finding the number of 


object pixels in each zone and dividing it by the total number of pixels. 
Algorithm: 


Input: Preprocessed inscription image 
Output: Feature vector of an image 
Begin 


Assign block_id=1 


eoaie ANNA UNIVERSITY, CHENNAI - 600 025 (4s) 


for each row i from I to 4 
read 10*10pixels in matrix format 
assign block[block_id]to read matrix 
increment block_id=block_id+1 
for each block_id from I to 6 
To read the block, read block[block_id] 
count=number of black pixels in the block 
Obtain signature array[block_id]=count 
end for 
End 


Result — signature array of the image 
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6.3 CLASSIFICATION BY ARTIFICIAL NEURAL NETWORK 


In this section, the ANN is trained using the backpropagation 


algorithm. After converting the Tamil inscriptions into, the matrices are fed to 


the ANN as input. After the feedforward algorithm lays the groundwork for 


the working of the neural network, the Back Propagation (BP) algorithm 


undertakes training, calculates errors, and modifies weights. The BP 


algorithm starts with computing the output layer, which is the only one where 


the desired output is available. The error rate in the output layer is calculated, 


based on the difference between the desired and actual output. In this 


research, the result of the BP-ANN is a matrix of 1x64. The output obtained 


from the BP-ANN can be used to obtain characters. 
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Hidden layer 


Input layer Output layer 


Charac 
ter 
Recog 


Features 
for 
character 


ima ge nition 


Figure 6.2 Neural Network architecture 


In the neural network, the leftmost nodes assign initial weights 
randomly and train continuously. Weights are added to and stored in the 
centermost nodes. Several nodes are generated and fall between O and 1, 
based on the threshold generated. The final weights are compared with the 
input, and the process is reversed to minimize _ errors. Finally, based on the 
weights, the vectors are compared with the test data and the corresponding 


character displayed if the weight matches. 


The same process is repeated when the single character that is cut 
from the input image is trained to find the features of the corresponding test 
data. The neural network reduces manual training and recognizes the output 
easily. The basic requirement of a neural network is that it needs the data 
necessary for accurate training. The weight differs in line with the different 
input fed. The artificial neural network is used to classify inscription scripts. 
The ANN is trained with the inscriptions feature vector and gives good 
classification accuracy. Consequently, a more accurate recognition of ancient 


Tamil characters from various a slew of stone inscriptions across centuries is 
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obtained. The captured stone inscription features are trained and tested, using 


the ANN, to recognize Tamil inscription scripts. 


Feature vector of the character 
Import neural network tool 


Import training data and test 


Start training and determine 


and revise weights 


Select parameters values, 


initialize weights 


Define network architecture 
Training neural network 


Classification of character 


image 
| 


Recognition of character image 


Figure 6.3 The flow of the Classification module 
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PSEUDO CODE: 
Step 1: Initialize the connection weights (weights are taken randomly) 


Step 2: Propagation entries entered the EFii are presented to the input 


layer: 

XXii = E; (5.7) 
The spread to the hidden layer is made using the following formula: 

Y = f ier X7 Wii + Xo) (5.8) 
Then from the hidden layer to the output layer, we adopt: 

Zy = f int Yi Wey + Yo) (5.9) 
where X, and Y9 are scalars 

(x) is the activation function: 

(x) = 1/ l+e—; (5.10) 
Step 3: Back propagation of error at the output layer, the error between the 

desired output S; and Z;, output is calculated by 

Ex = Ze (Ze — 1) (Sz - Ze) (5.11) 


The error calculation is propagated on the hidden layer using the 


following formula: 
Y= fp — Y) Dhar Way Ex (5.12) 


Step 4: Fixed connection weights the connection weights between the input 


layer and hidden layer is corrected by: 
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DY, = nF; (5.14) 


Then, the change of connections between the input layer and the 


output layer by: 


DWy; = nY,Ey (5.15) 


N is a parameter to be determined empirically. 


Step 5: Loop to step 2 until a stop to define criterion (error threshold, the 


number of iterations). 


Features are extracted from the skeleton of a character. Figure 6.4 
below represents the feature vector of an inscription character image. Feature 
vectors are imported into the network as inputs, and the network trained as 


shown in Figure 6.5. 


0.013 0.2525 0.1975 


0.0142 0.1244 0.2326 0.1131 0.0185 


0.2058 0.0107 0.2224 0.1981 0.0166 


0.0124 0.2519 0.197 0.0087 1 


0.1077 0.0157 0.1746 0.1191 0.0133 


0.1296 0.0202 0.2551 0.1473 0.0074 


0.1066 0.015 0.1715 0.2108 0.018 


0.1358 0.0162 0.25 0.1493 0.0071 


0.2142 0.019 0.2608 0.1501 0.0068 


0.1242 0.0183 0.3445 0.1189 


0.1007 0.0192 0.2271 0.1495 


Figure 6.4 Stored feature vectors into a table 
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Figure 6.6 Neural network training regression 
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6.3.1 Evaluation Metrics 


The proposed system evaluates the performance of the 
classification model by calculating standard performance measures including 
precision, recall, and f-measure. Here, the results of 24 classified characters 
are shown in Table 6.2. The actual input is the column matrix with class 
labels predicted by features, and the predicted value is the column matrix with 


class labels predicted by the classification model. 


True positive 


Precision = Sea a See (5.17) 
True positive+False positive 

True positive 

Recall = ————_——————_ (5.18) 

True positive+False negative 

2*precision*Recall 

f — measure = ————— (5.19) 
precision+Recall 


Table 6.2 Performance metrics of character recognition using the ANN 


Ancient Tamil characters! Modern Tamil |ANN Classification results 
from stone characters Precision |Recall| Accuracy 


Vowels 


A 89.19 | 82.50} 78.01 


75.36 | 81.25 | 75.36 


D159. - | 85.53 | Slo2 


80.65 | 89.29} 77.50 


81.25 | 92.86} 80.30 


80.21 |93.24} 82.14 


72.50 | 74.36} 87.88 


2) 98.09 | f° |Co | bP |Y 
9/8) AlP | Die 
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Table 6.2 (Continued) 
Ancient Tamil characters| Modern Tamil [ANN Classification results 
from stone characters Precision|Recall| Accuracy 
Consonants 

a & 92.86 |76.47| 75.12 
fa [hl 70.50 | 86.54] 69.61 
cf & 72.22 |86.67) 72.00 
od, © 87.50 |81.16| 76.40 
e. L 68.75 | 84.62) 65.00 
ort 600 92.86 |75.12| 76.47 
hy 5 81.34 | 93.43] 79.37 
%' 1 80.37 | 87.71] 75.16 
ma) u 80.29 | 85.71| 73.68 
i) LO 79.87 | 85.36) 74.50 
val WwW) 84.38 | 73.67| 71.00 
bes 7 79.87 |85.36| 74.50 
OU 60 72.22 | 86.67) 72.00 
Ga eu 92.31 |79.00| 82.14 
— Lp 85.32 |80.13| 79.21 
al en 76.92 |96.77| 77.78 
on oO 71.83 | 85.02] 77.80 


t 
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Table 6.2 above gives the precision, recall and accuracy results 
obtained in character recognition using the ANN. The overall accuracy rate 


obtained is 80%. 
6.4 SUMMARY 


This chapter has attempted an implementation of the ANN, given 
the lack of a standard method or algorithm to recognize characters from stone 
inscription images. Several classification models had been tried earlier on 
ancient inscription characters to predict them correctly. Chapter 5 tested the 
pattern matching algorithm. Here, in Chapter 6, the next approach proposed 
using the ANN has been tested, wherein features initially extracted from 
characters are input to the neural network. The ANN predicts characters better 
than the pattern matching algorithm, as a larger number of features are 
extracted from each character. However, despite the massive volume of data 
to be trained, and the fact that similar kinds of characters have had their 
features wrongly extracted, a recognition rate accuracy of 80% has been 


achieved. 
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CHAPTER 7 


CLASSIFICATION AND TRANSLITERATION USING THE 
CONVOLUTIONAL NEURAL NETWORK 


7.1 INTRODUCTION 


This chapter deals with a proposed classification model, the 
Convolutional Neural Network (CNN), for recognizing ancient characters in 
stone inscription images. Stone inscription characters are naturally 
unconstrained. A deep learning model is developed that classifies ancient 
characters using a multilayer feedforward network called the CNN. The CNN 
is robust at extracting features from any kind of image and is a most 
representative supervised learning model. Classifying characters is a 
challenge task involves taking inputs and producing output from a class that 


describes the character in question. 


7.2 AN OVERVIEW ON THE DEEP CONVOLUTIONAL 
NEURAL NETWORK 


Segmented character images with an array of pixels are fed into the 
neural network. The first CNN process is convolution by means of a filter 
mask with a kernel template window. This is because the filter helps find the 
edge features by moving across the whole input image and trying every 
possible match. Consequently, an image becomes a stack of a filtered image 
after convolution. Rectified Linear Units (ReLu) are applied to normalize the 


image, i.e. all negative values are replaced by zeros. Next, the image moves to 
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the second layer, called pooling, which helps obtain fine details from the pixel 
values and shrink the image stack. These layers are repeated several times so 
the system gets trained well. Finally, the third fully-connected layer is formed 
by getting a finalized vote. From the vote, the character is classified and 


recognized. The CNN architecture is shown in Figure 7.1 


~1L SO YOU | = 
— — y 


aa aa connected 


Convolution Rectified Pooling 
layer Linear units layer 


layer 


| 


Recognized 
character 
based on 

votes 


Figure 7.1 The Deep Convolutional Neural Network 
7.3 LEARNING ARCHITECTURE 


Deep learning models are among the most powerful classifiers. 
They excel in image recognition, because a deep learning model like the CNN 
easily identifies image structure. The advantage of a deep learning model lies 
in its automatic feature extraction through finding local connectivity among 
pixels, and its weight-sharing strategy with similar texture and brightness 
properties. Hence, developing a deep learning model would be the _ best 
choice for recognizing characters from stone inscription images. Such a 
model also helps handle huge volumes of data to achieve a high recognition 
rate. Recognizing characters etched on stone is difficult on account of the 


writing style in vogue at the time. 
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The proposed framework comprises three parts, as follows: (i) 
Samples are generated from training large number of characters, (11) A CNN 
network model is designed, based on the properties of the characters, and (111) 
Voting is employed for the final recognition result. The proposed CNN 


training architecture is shown in Figure 7.2. 
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Figure 7.2. Training CNN architecture for recognizing inscription 
characters 

In the proposed CNN architectural framework, the input layer takes 

raw pixel values from an image with dimensions of 150 X 150 in terms of its 

width and height. The next layer is the convolutional layer that contains 


neurons connected to the input. Here, there are four convolutional layers with 
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the Rectified Linear units (ReLu) activation function for normalization. First, 
the three convolutional layers use 32 filters with 3X 3 kernels and max 
pooling with 2 X 2 to perform a down sampling operation, while the fourth 
layer alone uses 64 filters. Finally, ReLu activation and 64 nodes are used on 
the fully-connected layers. This layer computes the class score, which is 
connected to SoftMax, to decide the output and distribute probabilities to all 
the nodes, based on the voting. Finally, the output layer is mapped with 
modern characters, using Unicode. Once the characters are recognized, they 


are transliterated and displayed in modern Tamil characters. 
7.3.1 Data collection 


Stone inscription script images totaling 250 in all were digitally 
acquired from the Tanjore Brihadeshwara Temple using DSLR camera. 
Contemporary Tamil has an alphabet that comprises 12 vowels, 18 
consonants, | special character and 12 x 18 compound characters, totaling 
216 characters, all of which together form 247 combinations. During the 11" 
century, however, only 7 vowels and 17 consonants were used to form a total 
143 combinations of characters. The images were divided into four groups, 
based on the lines and characters on them, as shown in Table 7.1. Of the total 
22000 characters handled, there were 8210 that had deteriorated greatly, 
leading to misclassification. Consonants were present in large numbers, while 
vowels and a combination of the two were relatively small in terms of the 


numbers making up the 22000 characters in all. 
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Table 7.1 Stone inscription image dataset count 


Number of 
of Number of Characters 
S. No.| Script size | characters (row x : nee 
Inscriptions count 
column) 

1 Large 26 x7 125 22150 
2 Medium 12x7 55 4620 
3 Small 8x7 40 2240 
4 Too small [5x4 30 600 

Total 250 30210 
yn: Computational considerations 


During the computation, the three major sources of memory that 


needed to be considered are listed below: 


fe 


The activation function present in every layer which transform 
volumes of information through a differentiable function. The 


ReLu activation function applies max (0, x). 


Categorical cross-entropy, which measures loss in_ the 
performance of a classification model, predicts the probability 


of the loss value. 


The RMSProp optimizer is a kind of gradient descent algorithm 


that helps increase learning rates. 


Implementation setup and algorithm workflow 


The python library is used to train the deep learning network to 


recognize inscription characters. The Graphics Processing Unit (GPU) system 


of the NVIDIA GeForce GTX 1080 series, with an 8 GB RAM and an Intel 
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Core 117 processor with a 32 GB RAM is used. The character recognition 
model is built and trained using Keras, which runs on TensorFlow. Algorithm 
steps: 

Training 


Step 1: Import image data generator, sequential model, layers as conv2D, 
Maxpooling 2D, Activation, Dropout, Flatten, Dense 


Step 2: Resize the character image as 150 X 150 
Step 3: Give the training samples as train data 


Step 4: — Define the hyper parameter with number of epochs=200 and batch 
size=16 


Step 5: Define CNN architecture with all layers mentioned in step1 
Con2D (32, (3, 3)) 
ReLu activation 
Maxpooling (2,2) 

Step 6: Add flatten, dense (64), Dropout (0.5), activation (ReLu, SoftMax) 


Step 7: Compile the model using categorical cross entropy loss function 
and RMSProp as optimizer. 


Step 8: Fit the model with training data 
Step 9: Monitor training accuracy and training loss 


Step 10: Save the trained model (Architecture and weights) 


Prediction 
Step 1: Load the trained model 


Step 2: Load the weights 
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Step 3: Compile the model 
Step 4: Resize the test image according to the model 
Step 5: Predict the image using compiled model 
classes = model.predict(img) 
Step 6: | Get the class name of the output 
class_number = np.argmax(classes) 
Step 7: Transliterate the character based on class name 
char_order = int(image.split('.')[1]) 
total[char_order] = transliteratecount.transliterate (class_number) 
Transliteration 
Step 1: Assign Unicode’s of Tamil character to the class names 
def transliterate(class_number) 
labels = {} 
labels=labels[class-number ] 
Step 2: Map them based on the output of the CNN prediction 
def get_unicode(label) 
unicode_dict = {} 
t = unicode_dict[label] 
Step 3: Repeat step 2 for each character in the line 


Step 4: Concatenate all the character outputs to form a sentence 
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7.4 IMPLEMENTATION RESULTS OF CNN 


It is plain from the literature reviewed that no transliteration work 
has been done on 11" century inscription images. These inscriptions are 
stored in a document format, called estampages, in Mysore in the state of 
Karnataka. The Department of Archaeology, Government of Tamil Nadu, has 
averred that estampaging and manual transliteration are laborious and time- 
consuming processes. There is, therefore, every reason to develop an 
automated system that will help preserve inscriptions in a digital format. 
Recognizing characters from stone inscription image is a challenge, given that 
the characters were inscribed centuries ago and have, in addition, suffered the 
ravages of time and deterioration brought on by natural hazards. This work is 
an initiative and the first of its kind to develop a powerful model to recognize 
inscription characters using a deep learning model termed the CNN. The 
model learns characters through variations, and trains the system to ascertain 


what characters they are. 


In this work, the CNN model is trained to recognize ancient 
characters. Each character, with n number of samples, is given as input to 
train the CNN. There are more consonants than vowels and combinational 
characters for training the CNN model. When the training data is small, it 
misclassifies combinational characters as compound characters. The 
computational results in terms of precision, recall and f-score are shown in 


Table 7.2. 
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Ancient Tamil Modern Tamil CNN 
characters from stone characters Precision) Recall |Accuracy 
a) rN 90.91 |81.08 85.71 
x 4 Oh, 77.78  |93.33  |75.00 
ms a) 94.12  |88.89 — |85.00 
e De 94.44 91.40 — |87.38 
a, 61 94.74 |96.77 |92.00 

) 69 94.12  |90.91 — |87.00 
G) @ 94.44 |92.39 — /88.00 
* & 94.25 91.11 — |87.00 
pa [hl 93.90 |88.51 [85.04 
cf & 90.59 88.74 —|85.61 
od, © 85.09 88.64 80.00 
he L 88.51 |91.67 83.57 
o~y 6OUT 87.24 |88.24 80.20 
dy 5 91.86 90.80  |85.04 
%,' ib) 91.67 [88.51 —|83.69 
us u 91.57 90.48 — |86.05 
ES) LO 94.19  |89.01 —|85.64 
LL) WwW 94.32 91.21 = (87.12 
sS y 94.44 95.51 —|91.56 
ON 6v 93.49 |95.51 —|89.98 
va ou 93.58  |96.26 |90.64 
=“— Lp 92.31 93.33 |87.21 
Oo en 89.45 |91.91 — |87.82 
on 60T 91.41 92.50 —|91.85 


t 
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Using CNN architecture, ancient characters are recognized by 


training the system with a huge volume of characters. For example, since the 


character a hs is found to have lot of volume, the system is trained with 500 
samples, but not all characters have the same volume of samples. Each 
character is trained with the count of characters available. From the results, it 
is inferred that certain compound characters and vowels have very few 
samples, say no more than around 50 to 100. Such characters with few 
samples reduce the recognition rate accuracy. Otherwise, compared to the 
pattern matching algorithm and ANN from the previous chapter, the CNN 
gives a higher accuracy rate of 90%. The overall accuracy rate from 


comparing the three machine learning models is shown in Figure 7.3 
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Figure 7.3. A comparison of the machine learning models developed for 
recognizing stone inscription character 

A comparison of the three machine learning models is made and 

shown in Figure 6.7. It depicts the three models, specifically developed for 

the purpose of recognizing stone inscription characters, and _ their 

corresponding overall accuracy. Figure 6.7 makes it plain that the CNN has 


the highest recognition rate in classifying stone inscription characters. 
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In a convolutional neural network, the training loss is computed at 
each step and backpropagated to the input cycle. This propagated value helps 
the next iteration improve accuracy, and the model becomes correspondingly 
more precise at each step. The accuracy of the model increases at every n‘” 
step and, finally, the accuracy at the end of training is 90% and the loss 
reduced to below 0.5%. The corresponding plot is shown in Figures 7.4 and 
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Figure 7.5 Plot on training loss 
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qo TRANSLITERATION RESULTS 


To transliterate ancient characters, a dictionary is created by 
mapping ancient characters with contemporary Tamil characters. Finally, the 
system is trained to print the contemporary Tamil character by matching them 
with Unicode. Transliteration is done for the sample segmented image shown 
in Figure 7.7 and 7.8. Figure 7.6 shows a 11" century character evolution 


chart standardized by the Department of Archaeology. 


Figure 7.6 A chart of ancient character received from the Department 
of Archaeology 
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Ancient characters in the inscription images under scrutiny, 
predicted by a team of epigraphists, serve as a reference for finding their 
equivalent contemporary Tamil characters, and so help in the transliteration 
process. Table 7.3 and 7.4 represent ancient 11" century characters, their 
corresponding contemporary Tamil characters, and Unicode for transliteration 


into modern Tamil characters. 


Table 7.3. Ancient vowels and their equivalents in contemporary Tamil 
with Unicode 


S. No. encsent i ecamar oye) Unicode | Modern Tamil vowels 
characters 
1 =}. OB85 cy 
2 Ay 0B86 Oy, 
3 ns, 0B87 ®) 
4 2. OB89 oe 
5 ll OB8E 6T 
6 3} 0B90 a) 
a GD 0B92 @ 
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Table 7.4 Ancient consonants and their equivalent contemporary 
Tamil with Unicode 


S. No. Ancient 11™ century Unicode Modern Tamil 
consonant characters Consonants 
1 a 0B95 & 
2 LY 0B99 Thi 
3 cf OB9A & 
4 eo OB9E © 
5 d OB9F Le 
6 si OBA3 6001 
7 3, OBA4 5 
8 % OBAS B 
9 ec) OBAA u 
10 a) OBAE LO 
11 L223 OBAF WI 
12 S OBBO y 
3 =| ONS OBB2 60 
4 |Qy OBB5 ou 
15 ce OBB4 Lp 
16 «|| Pe OBCD en 
17 | CU} OBA oor 
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File Edit View Search Terminal Help 
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Figure 7.7 Image 1: Transliterated characters 
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NIT RAKIR 
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Figure 7.8 Image 2: Transliterated characters 
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7.6 SUMMARY 


In this chapter, a final end-to-end solution has been derived by 
using a deep learning model that automatically recognizes and transliterates 
characters in inscription images. A character in the form of an image is input 
to the CNN model, wherein the CNN classifier extracts features during 
training. From the trained model, the CNN easily recognizes characters 
given during testing. This is the first CNN model designed to recognize stone 
inscription characters, and has achieved rate of 90% with a loss under 10%. 
Thus, the CNN model helps train large volumes of data, clearly resulting in a 
high recognition rate . In this chapter, transliteration is done by mapping 
ancient characters to Unicode and printing them in contemporary Tamil. This 
work developed a new system to transliterate charcaters automatically, which 
helps reduce the manual labour involved in deciphering data stone 


inscriptions. 
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CHAPTER 8 


CONCLUSION AND FUTURE WORK 


8.1 CONCLUSION 


Image processing and pattern recognition techniques have made 
great strides in recent times. As far as stone inscriptions are concerned, 
however, the method of acquisition used by the Department of Archeology is 
traditional, laborious and time- consuming. The manual work of translation is 
done by epigraphers, and these manually-acquired inscriptions are preserved 
only in a degradable paper format. A huge mass of inscriptions is available, as 
yet unnoticed by the Department of Archeology. A digital acquisition of stone 
inscriptions will resolve the issue of the manual work still in progress, greatly 
reducing the time taken for the process, from as much as a few years to as 
little as a month. Such a digital recognition system is essential on account of 
the rapid deterioration and loss of content from stone inscription images. This 
research has focused on state-of-the-art approaches in digital acquisition, and 
on the automated transliteration of characters found in stone inscription 


images. 


A brief study of the literature on the subject makes it clear that 
work on character recognition has only focused on clear document images 
and scanned handwritten images. There is very little research on recognizing 
characters from real- time images. Intensive research in the field of OCR is 
chiefly driven by the challenge it presents in simulating human reading, and 


also because it provides efficient applications such as the automatic 
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processing of bulk papers, transferring data to machines, and web interfaces 
for paper documents. To have machines replicate human functions and 


perform common tasks like reading is a dream come true. 


This research has dealt with the complexities involved in separating 
foreground characters from the background. There are difficulties involved in 
dealing with input images, especially in terms of complex variations in 
character shapes and deformities brought on by deterioration. Handling 
document images is far easier than handling stone inscription images, given 
that the latter revel a lack of continuity in inscription character. Hence, an 
efficient binarization technique has been proposed to extract characters from 


stone inscriptions. 


The next module of this research has resolved the issue of 
segmenting complete scripts into individual characters. the complete scripts 
into characters. To this end, an adaptive thresholding technique has been 
proposed, based on a histogram plot, that helps segment lines and characters. 
In the subsequent stage, machine learning models have been developed to 
identify unique character features that help recognize and classify characters. 
Both the pattern matching algorithm and ANN produced a lower recognition 
rate and a higher _misclassification rate. Consequently, the deep learning 
model, CNN, has been proposed to train a large number of characters and 


recognize inscription characters by achieving a higher accuracy rate. 


The results show that the techniques used in the study have 
demonstrated a good accuracy rate. The work will help epigraphists and 
researchers access and interpret the information and knowledge retrieved 
from ancient stone inscription images. Further, it will keep the general public 
interested in the contents of inscriptions, now accessible in the contents of 


inscriptions, now accessible in a user-friendly digital format. In addition, this 
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system will be of use to the Department of Archeology as it greatly reduces 


the time taken for manual translation. 
8.2 FUTURE WORK 


From this study and the results obtained, future directions can 


include the following: 


e Since the proposed enhancement technique works for clear 
inscription images, future work can focus on reconstructing 


deteriorated characters. 


e The proposed work has concentrated only on 11" century 
inscriptions, on account of the huge volume of characters 
handled, stemming from the proliferation of work done during 
the era. In the future, inscriptions from other centuries can be 


handled as well. 


e The accurate prediction of compound characters and vowels can 
be improved by training the deep learning model on a much 


larger character dataset. 


e This system transliterates characters as such. Future work can 
include the grouping of characters into word so as to form 


meaningful sentences. 
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ANNEXURE I 
EXPERT REPORT —- DATASET ACQUISITION 


IT am Mr.K.T.Gandhirajan from Tamil Virtual Academy gave 
guidance for Ms.K.Durga Devi, Research scholar in Computer Science and 
Engineering department, College of Engineering Guindy, Anna University, 
Chennai to collect dataset from Tanjore brihadeeswara temple — 11" century 
inscriptions. She approached me to know the details about inscription’s 
location and volume present. From archaeology side the inscriptions were 
acquired through manual method of estampaging and manual transliteration. 
Her proposal is to develop inscription transliteration system. Since her proposal 


is new, she has given guidance to acquire the data. 


My 
Tamil Virtual Academy 


Mr.K.T.Gandhirajan 
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ANNEXURE II 
EXPERT REPORT —- VALIDATION 


Ms.K.Durga Devi, Research scholar in Computer Science and 
Engineering department, College of Engineering Guindy, Anna University, 
Chennai worked on character recognition from stone inscription image taken 
from Tanjore brihadeeswara temple — 11" century inscriptions. She developed 
the system for automatic inscription transliteration system to convert the 
ancient characters to contemporary Tamil characters. The results achieved by 
her through the developed system is validated manually by the epigraphist and 


found to be that, she achieved 90% of result in transliteration. 


}3]!9 


Tamil Virtual Academy 
Dy~ Gi, Sage kolo 
ReAourte P&ASen 


Sh. 
Sa ANNA UNIVERSITY, CHENNAI - 600 025 


149 


150 


ANNEXURE III 


SMEG 97s 
AsrevoSluisd Hon 
DLCOGHTULNT D|syjousvertD, 
Sup ueriésl uorirsio, 
SiMlips srromev, srUpLOLy,, 
Glscironcor—8. 
fh. oT 660T.(9}4/2557/2017, mireit_07.06.2017 


Qurgét Agreed sim — HG. K. srisrsoll, gy iiowremt, oswsrevorrr 
Ldsmodsps, Gesronor — sip sel Gest orga 
FOO Hpi} — Hiolipssre sevGlos_OiiLtg CourssorHs60 — 
Hioomudlesr epoid HVC OW cigessir ourprasesl AILS omooresetr 
AsMaIs He — Er runrs. 


uri SG. K. gitsnCsoll, gyliouromi, o\siorovorr Liddesmovd spd, 
Oecironsr Sips miei 12.04.2017. 


LTTDEUMe sTEMHNLD GHB SyeSHFHD Cor OSGesrsvorsHorriip, rae Creo Olio 
Agmiturs Ggsfley Gatiisysror sensvtny “DEVELOPING A SMART SYSTEM FOR 
DIGITIZATION AND DECIPHERMENT OF TAMIL STONE INSCRIPTIONS” G@igm_tturss 
SHS Srapd Hubsonorsahesr Sgtiismuile sedvClou Gi Ligseir FMB OHH G OULU (ple 04 
Gain Geren. 


AL ensor soir 


1) u@smpt sioapow Project-@ solr Gam) THHSGH, sTuGZsGid O)soor [hol omGoT 
ALP HIE Fri. 


2) oiorfls FHuirs QaAsvGlou Qesomor wwii HHS SoS}. 
3) Project-sir eypeuld gout Garis; Lowflescir ohm enssuumodor siren Glssmyfled gy L1 aflouyrareseit 
om GuerGurcsr wood Hard Disk-« ugley Geis) sioomde, si0iililses 


Couexor (HL. 


4) Project siewbistiuLoysiron pret Loa PLL SeLLILoyerror [EIreiT oA oMouytssoner 
HONSE QMAmsurs sors ui GoursoorGHo. 


5) sg! Project yFSasSHled HomMsG Hoorn Qe Malssev Couowor(i0. 


6) udevant, LiTéinguit Lom Cempt sro sevClon Oli tiger 10 sraiorsodonsescir 
Lig LILIg ITs OILPTELILIGILD. 


BE 
Sea ANNA UNIVERSITY, CHENNAI - 600 025 


151 


= = 


srost on, Guogbesmrgopya Aub senor ens UiO Project Hiuriscwueyeiror pier Lomo 
QULCO_ASUUL oyster spre oy, lw Glesnieesi FIOMSEG OMsnsurs Csfalseorm 
CEL OSG rsirorliuHAmMs. 


6ptb/— 
GS revolucd Hiomsoor Hovoresrrooofliurerii(Gl.in) 


[a.goSwofleinsy! §— En. Dorn Copan) 
pend ouserteat gir 


x \) 
Gumi 
K. Durga a 
Research Scholar, 
Department of Computer Science & Engineering, 
CEG Campus, 


Anna University, 
Chennai-600 025. 


[560 

Dr. P. Uma Maheswari, 

Associate Professor, 

Department of Computer Science and Engineering, 
CEG Campus, 

Anna University, 

Chennai-600 025. 


a ANNA UNIVERSITY, CHENNAI - 600 025 


LO, 


152 


REFERENCES 


Aggarwal, Ashutosh, Karamjeet Singh & Kamalpreet Singh 2015, 
‘Use of gradient technique for extracting features from handwritten 
gurmukhi characters and numerals’, Procedia Computer Science, 
vol. 46, pp. 1716-1723. 


AL-Farjat, AH 2012, ‘Automatic Transliteration Among indic scripts 
using code mapping formula’. European Scientific Journal, ESJ, vol. 8, 
no. 11. 


Alginahi, Y 2010, ‘Preprocessing techniques in character recognition’. 
In: Character recognition. InTech. 


Aouadi, N & Kacem, A 2017, ‘A proposal for touching component 
segmentation in Arabic manuscripts’. Pattern Analysis and 
Applications, vol. 20, no. 4, pp. 1005-1027. 


Arif Mohammed, M, Nasien, D, Hassen, H & Haron, H 2015, ‘A 
review on Feature Extraction and Feature Selection for Handwritten 
Character Recognition’, International Journal of Advanced Computer 
Science and Applications, vol. 6, no. 2, pp. 204-207. 


Bataineh, B, Abdullah, SNHS & Omar, K 2017, ‘Adaptive binarization 
method for degraded document images based on surface contrast 
variation’. Pattern Anal Appl. 20.3, pp. 639-652. 


Basu, JK, Bhattacharyya, D & Kim, TH 2010, ‘Use of artificial neural 
network in pattern recognition’. International journal of software 
engineering and its applications, vol. 4, no. 2. 


Belongie, S, Malik, J & Puzicha, J 2002, ‘Shape matching and object 
recognition using shape contexts’. California Univ San Diego La Jolla 
Dept. of Computer Science and Engineering. 


Bernsen, J 1986, ‘Dynamic Thresholding of Grey level images’. In: 
Proceedings of the eighth international conference on pattern 
recognition, vol 2, no. 2, pp. 1251. 


Beulah, PM & Sahana, R 2015, ‘Efficient modelling technique for 
classification and transliteration of ancient stone inscription’. ARPN 
Journal of Engineering and Applied Sciences, vol. 10, no. 7, 
pp. 2897- 2902. 


fea ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


11. 


D2, 


1, 


14. 


LS; 


6, 


17. 


18. 


. 


20. 


153 


Bharath, A & Sriganesh Madhvanath 2012, ‘HMM-based lexicon- 
driven and lexicon-free word recognition for online handwritten Indic 
scripts’. IEEE transactions on pattern analysis and machine 
intelligence 34.4, pp. 670-682. 


Bhuvaneswari, G & Subbiah Bharathi, V 2015, ‘An efficient positional 
algorithm for recognition of ancient stone inscription characters’. In 
2015 seventh international conference on advanced computing 
(ICoAC). IEEE. 


Chacko, AMM & Dhanya, PM 2015, ‘Multiple classifier system for 
offline malayalam character recognition’. Procedia Computer 
Science, vol. 46, pp. 86-92. 


ChakiN, Shaikh, SH & Saeed, K 2014, ‘A comprehensive survey on 
image binarization techniques’, In: Chaki N (ed) Exploring image 
binarization techniques. Springer, New Delhi, pp. 5-15. 


Chaowicharat, Ekawat, Kanlaya Naruedomkul & Nick Cercone 2016, 
‘Direction histogram: novel discriminative global feature for Thai 
offline handwritten OCR’. Pattern Analysis and Applications 19.4 , 
pp. 1069-1080. 


Chandrakala, HT & Thippeswamy, G 2017, ‘Epigraphic Document 
Image Enhancement Using Retinex Method’. In International 
Symposium on Signal Processing and Intelligent Recognition 
Systems (pp. 178-184). Springer, Cham. 


Chaudhuri, A, Mandaviya, K, Badelia, P & Ghosh, SK 2017, ‘Optical 
character recognition systems for Hindi language’. In Optical 
Character Recognition Systems for Different Languages with Soft 
Computing. Springer, Cham, pp. 193-216. 


Cheung, KW, Yeung, DY & Chin, RT 1998, ‘A Bayesian framework 
for deformable pattern recognition with application to handwritten 
character recognition’. IEEE Transactions on Pattern Analysis and 
Machine Intelligence, vol. 20, no. 12, pp. 1382-1388. 


Choudhary, A, Rishi, R & Ahlawat, S 2013, ‘Off-line handwritten 
character recognition using features extracted from  binarization 
technique’. Aasri Procedia, vol. 4, pp. 306-312. 


Das, Sugata, Sekhar Mandal & Amit Kumar Das 2015, ‘Binarization 


of stone inscripted documents’, Computer Graphics, Vision and 
Information Security (CGVIS), IEEE. 


asta ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


21. 


22, 


23. 


24. 


25; 


26. 


2. 


28. 


29. 


30. 


an 


154 


Das, S & Banerjee, S 2015, ‘An algorithm for Japanese character 
recognition’. International Journal of Image, Graphics and Signal 
Processing (IJIGSP), vol. 7, no. 1, pp. 9-15. 


Dave, N 2015, ‘Segmentation methods for hand written character 
recognition’. International journal of signal processing, image 
processing and pattern recognition, vol. 8, no. 4, pp. 155-164. 


Del Bimbo, Alberto, Stefania Santini & Jose Sanz 1994, ‘OCR from 
poor quality images by deformation of elastic templates’. Pattern 
Recognition, 1994. Vol. 2- Conference B: Computer Vision and Image 
Processing., Proceedings of the 12th IAPR International. Conference 
on IEEE, vol. 2. 


Devi, H 2006, ‘Thresholding: A Pixel-Level image processing 
methodology preprocessing technique for an OCR system for the 
Brahmi script’. Ancient Asia, 1. 


Elakkiya, V, Muthumani, I & Jegajothi, M 2017, ‘Tamil text 
recognition using KNN classifier’. Advances in Natural and Applied 
Sciences, vol. 11, no. 7, pp. 41-46. 


Elgammal, AM & Ismail, MA 2001, ‘A graph-based segmentation and 
feature extraction framework for Arabic text recognition’. 
In Proceedings of Sixth International Conference on Document 
Analysis and Recognition, IEEE, pp. 622-626. 


Fischer, A, Keller, A, Frinken, V & Bunke, H 2012, ‘Lexicon-free 
handwritten word spotting using character HMMs’. Pattern 
Recognition Letters, vol. 33, no. 7, pp. 934-942. 


Fujisawa, H & Liu, CL 2003, August. Directional pattern matching for 
character recognition revisited. 


Garg, NK, Kaur, L & Jindal, MK 2010, ‘Segmentation of handwritten 
Hindi text’. International Journal of Computer Applications, vol. 1, 
no. 4, pp. 22-26. 


Gatos, B, Pratikakis, I & Perantonis, SJ 2006, “Adaptive degraded 
document image binarization’. Pattern Recognit, vol. 39, no. 3, pp. 


SL7-327. 


http://www.tnarch.gov.in/epi.htm 


eoaia ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


a2 


33) 


34. 


33: 


36. 


a18 


38. 


Be 


40. 


155 


Janani, G, Vishalini, V & Kumar, PM 2016, ‘Recognition and analysis 
of Tamil inscriptions and mapping using image processing techniques’. 
In Science Technology Engineering and Management (ICONSTEM), 
Second International Conference on IEEE, pp. 181-184. 


Jayanthi, N, Tomar, A, Raj, A, Indu, S & Chaudhury, S 2014, 
‘Digitization of historic inscription images using cumulants based 
simultaneous blind source extraction’. Proceedings of the Indian 
Conference on Computer Vision Graphics and Image Processing, 
ACM, p. 51. 


Jeniffer R Angelin & Bhuvaneswari, G 2014, ‘Image Glazing for 
Thinning of Ancient Tamil Characters’, International Journal of 
Scientific & Engineering Research, vol. 5, issue 6. 


Kacem Afef & Asma Saidani 2017, ‘A texture-based approach for 
word script and nature identification’. Pattern Analysis and 
Applications 20.4, pp. 1157-1167. 


Kan, C & Srinath, MD 2002, ‘Invariant character recognition with 
Zernike and orthogonal Fourier—Mellin moments’. Pattern recognition, 
vol. 35, no. 1, pp. 143-154. 


Kapur, JN, Sahoo, PK & Wong, AKC 1985, ‘A new method for gray- 
level picture thresholding using the entropy of the histogram’. Comput 
Vis Graph Image Process 29.3, pp. 273-285. 


Karunarathne, KGND, Liyanage, KV, Ruwanmini, DAS, Dias, GKA 
& Nandasara, ST 2017, ‘Recognizing ancient Sinhala Inscription 
Characters using Neural Network Technologies’. International Journal 
of Scientific Engineering and Applied Sciences, vol. 3. 


Kavallieratou, E & Stathis, S 2006, ‘Adaptive binarization of historical 
document images’. In: 18th international conference on pattern 
recognition, (ICPR), IEEE, vol. 3. 


Kavitha, AS, Shivakumara, P & Kumar, GH 2013, ‘Skewness and 
nearest neighbour-based approach for historical document 
classification’. In 2013 International Conference on Communication 
Systems and Network Technologies IEEE, pp. 602-606. 


eoaia ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


Al. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


156 


Kunchukuttan, A, Puduppully, R & Bhattacharyya, P 2015, Brahmi- 
Net: A transliteration and script conversion system for languages of the 
Indian subcontinent’. In Proceedings of the 2015 Conference of the 
North American Chapter of the Association for Computational 
Linguistics: Demonstrations, pp. 81-85. 


Kurniawan, F, Rahim, MSM, Daman, D, Rehman, A, Mohamad, D “% 
Mariyam, S 2011, ‘Region-based touched character segmentation in 
handwritten words’. Int J Innov Comput Inf Control, vol. 7, no. 6, pp. 
3107-3120. 


Lee H Cetal 2016, ‘An improved adaptive binarization algorithm based 
on fuzzy logic’. Int J Softw Eng Appl 10.10, pp. 1-8. 


Leutenegger, S, Chli, M & Siegwart, RY 2011, ‘BRISK: Binary robust 
invariant scalable key points. In Computer Vision (ICCV)’, 
International Conference on IEEE, pp. 2548-2555. 


Liao, PS, Chen, TS & Chung Pau-Choo 2001, ‘A fast algorithm for 
multilevel thresholding’. J Inf Sci Eng., vol. 17, no. 5, pp. 713-727. 


Liu, CL, Kim, IJ & Kim, JH 2001, ‘Model-based stroke extraction and 
matching for handwritten Chinese character recognition’. Pattern 
Recognition, vol. 34, no. 12, pp. 2339-2352. 


Louloudis, G, Gatos, B, Pratikakis, I & Halatsis, C 2009, ‘Text line 
and word segmentation of handwritten documents’. Pattern 
Recognition, vol. 42, no. 12, pp. 3169-3183. 


Lu, S, Su, B & Tan Chew Lim 2010, ‘Document image binarization 
using background estimation and stroke edges’. Int J Doc Anal 
Recognition 13.4, pp. 303-314. 


Mahalakshmi, M & Sharavanan, M 2013, ‘Ancient Tamil script and 
recognition and translation using LabVIEW’. In Communications and 
Signal Processing (ICCSP), 2013 International Conference on IEEE, 
pp. 1021-1026. 


Mangla, P & Kaur, H 2014, ‘An end detection algorithm for 
segmentation of broken and touching characters in handwritten 
Gurumukhi_ word’. In Reliability, Infocom Technologies and 
Optimization (ICRITO)(Trends and Future Directions), 2014 3rd 
International Conference on IEEE, pp. 1-4. 


eoaia ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


at, 


oe: 


53, 


54. 


oD: 


56, 


as 


58. 


59. 


157 


Manigandan, T, Vidhya, V, Dhanalakshmi, V & Nirmala, B 2017, 
‘Tamil character recognition from ancient epigraphical inscription 
using OCR and NLP’. In 2017 International Conference on Energy, 
Communication, Data Analytics and Soft Computing (ICECDS) IEEE, 
pp. 1008-1011. 


Manmatha, R & Rothfeder, JL 2005, ‘A scale space approach for 
automatically segmenting words from_ historical handwritten 
documents’, IEEE Transactions on Pattern Analysis and Machine 
Intelligence, vol. 27, no. 8, pp. 1212-1225. 


Mari, SS & Raju, G 2015, ‘Modified view-based approaches for 
handwritten Tamil character recognition’ . ICTACT Journal on Image 
& Video Processing, vol. 6, no. 1. 


Mei, Y, Wang, X & Wang, J 2013, ‘A Chinese character segmentation 
algorithm for complicated printed documents’. International Journal of 
Signal Processing, Image Processing and Pattern Recognition, vol. 6, 
no. 3, pp. 91-100. 


Meng, L, Aravinda, CV, Reddy, KUK, Izumi, T & Yamazaki, K 2018, 
‘Ancient Asian Character Recognition for Literature Preservation and 
Understanding’. In Euro-Mediterranean Conference, Springer, Cham, 
pp. 741-751. 


Mohammad, F, Anarase, J, Shingote, M & Ghanwat, P 2014, ‘Optical 
character recognition implementation using pattern matching’. 
International Journal of Computer Science and _ Information 
Technologies, vol. 5, no. 2, pp. 2088-2090. 


Mohana, HS, Navya, K, Rajithkumar, BK & Nagesh, C, 2014, 
‘Interactive segmentation for character extraction in stone 
inscriptions’. In Current Trends in Engineering and Technology 
(ICCTET), 2nd International Conference on IEEE pp. 321-327... 


Mohana, HS, Pradeepa, R, Kammar, PN & Rajithkumar, BK 2015. 
‘Identification and recognition of ancient stone in-scripted Hoysala 
characters using support vector machine (SVM) model’. International 
Journal of Innovative Research and Technology, p. 12. 


Murthy, KS, Kumar, GH, Kumar, PS & Ranganath, PR, 2004, ‘Nearest 
neighbour clustering-based approach for line and _ character 
segmentation in epigraphical scripts’. In International Conference on 
Cognitive Systems, New Delhi, pp. 14-15. 


asta ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71, 


158 


Nawaz, T, Naqvi, SAHS, Rehman, H & Faiz, A, 2009. ‘Optical 
character recognition system for urdu (naskh font) using pattern 
matching technique’. International Journal of Image Processing (IJIP), 
vol. 3 no. 3, p. 92. 


Nguyen, KC & Nakagawa, M, 2016, ‘Text-line and character 
segmentation for offline recognition of handwritten japanese 
text’. IEICE technical report, vol. 115 no. 517, pp. 53-58. 


Niblack, W, 1986, ‘Introduction to digital image processing’. Prentice 
Hall, Englewood Cliffs, pp. 115-116. 


Otsu, N, 1979, ‘A threshold selection method from gray-level 
histograms’. IEEE Trans Syst Man Cybern vol. 9 no. 1 pp. 62-66. 


Pal, U, Roy, PP, Tripathy, N & Llados, J, 2010, ‘Multi-oriented Bangla 
and Devnagari text recognition’. Pattern Recognition, vol. 43 no. 12, 
pp. 4124-4136. 


Pannirselvam, S & Ponmani, S, 2014, ‘A Novel Hybrid Model for 
Tamil Handwritten Character Segmentation’. International Journal of 
Scientific & Engineering Research, vol. 5 no. 11, pp. 271-275. 


Pillai, S & Vaiyapuri, 1956, ‘History of Tamil language and literature’. 


Prasad, JR, Kulkarni, UV & Prasad, RS, 2009, ‘Offline handwritten 
character recognition of Gujrati script using pattern matching’. In Anti- 
counterfeiting, Security, and Identification in Communication, ASID 
3rd IEEE International Conference on pp. 611-615. 


RajaKumar, S, & V SubbiahBharathi, 2012, ‘Eighth century Tamil 
consonants recognition from stone inscriptions’ Recent Trends In 
Information Technology (ICRTIT), IEEE International Conference. 


Raj, MAR & Abirami, S, 2012, ‘A survey on Tamil Handwritten 
Character Recognition’. 


Rajkumar, S & Bharathi, S 2012, ‘Ancient Tamil script recognition 
from stone inscriptions using slant removal method’. ICEEBE. 


Rajithkumar, BK, & HS Mohana 2014, ‘Template Matching Method 
for Recognition of Stone Inscripted Kannada Characters of Different 
Time Frames Based on Correlation Analysis’. International Journal of 
Electrical and Computer Engineering vol. 4 no. 5 pp. 794. 


asta ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


fee 


yes 


74. 


73s 


76. 


TI, 


78. 


79. 


80. 


81. 


82. 


159 


Rusiol, M, Aldavert, D, Toledo, R & Llads, J, 2015, ‘Efficient 
segmentation-free keyword spotting in _ historical document 
collections’. Pattern Recognition, vol. 4 no. 2, pp. 545-555. 


Ruwanmini, DAS, Liyanage, KV, Karunarathne, KGND, Dias, GKA 
& Nandasara, ST 2016, ‘An architecture for an inscription recognition 
system for sinhala epigraphy’. International Journal of esearch 
Granthaalayah, vol. 4, no. 12, p. 48-64. 


Roy, PP, Bhunia, AK, Das, A, Dey, P & Pal, U, 2016, ‘HMM-based 
Indic handwritten word recognition using zone segmentation’. Pattern 
Recognition, vol. 60, pp. 1057-1075. 


Ryu, J, Koo, HI & Cho, NI, 2015, ‘Word segmentation method for 
handwritten documents based on structured learning’. IEEE Signal 
Processing Letters, vol. 22 no. 8, pp. 1161-1165. 


Saba, T, Rehman, A & Sulong, G, 2011, ‘Cursive script segmentation 
with neural confidence’. Int J Innov Comput Inf Control, IJICIC, vol. 7 
no. 7, pp. 1-10. 


Saeed, K & Albakoor, M, 2009, ‘Region growing based segmentation 
algorithm for typewritten and handwritten text recognition’. Applied 
Soft Computing, vol. 9 no. 2, pp. 608-617. 


Sauvola J & Pietikinen M 2000, ‘Adaptive document image 
binarization’. Pattern Recogn vol. 33 no. 2 pp. 225-236. 


Schmid, C, Mohr, R & Bauckhage, C, 2000, ‘Evaluation of interest 
point detectors’. International Journal of computer vision, vol. 37, 
no. 2, pp. 151-172. 


Seethalakshmi, R, Sreeranjani, TR, Balachandar, T, Singh, A, Singh, 
M, Ratan, R & Kumar, S, 2005, ‘Optical character recognition for 
printed Tamil text using Unicode’. Journal of Zhejiang University- 
SCIENCE, vol. 6 no. 11, pp. 1297-1305. 


Sesadri U, Sankar B S, Nagaraju C 2015, ‘Fuzzy entropy based 
optimal thresholding technique for image enhancement’. Int J Soft 
Comput IJSC vol. 6 no. 2 pp. 17-26. 


Shanthi, N & Duraiswamy, K, 2010, ‘A novel SVM-based handwritten 


Tamil character recognition system’. Pattern Analysis and 
Applications, vol. 13 no. 2, pp. 173-180. 


oats ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


a1. 


o2, 


160 


Shaus A, Turkel E, Piasetzky E 2012, ‘Binarization of first temple 
period inscriptions: performance of existing algorithms and a new 
registration-based scheme’. International conference on frontiers in 
handwriting recognition (ICFHR). IEEE. 


Siddiqui, M, Siddiqi, I & Khurshid, K, 2018, ‘Feature Extraction for 
Cursive Language Document Images: Using Discrete Cosine 
Transform, Discrete Wavelet Transform and Gabor Filter? ACM. 
In Proceedings of the 2nd Mediterranean Conference on Pattern 
Recognition and Artificial Intelligence pp. 84-87. 


Singh, TR, Roy, S, Singh, OI, Sinam, T & Singh, K, 2012, ‘A new 
local adaptive thresholding technique in binarization’. arXiv preprint 
arXiv: 1201.5227. 


Soumya, A, & Kumar, GH, 2014, ‘Pre-processing of camera captured 
inscriptions and segmentation of handwritten Kannada text’. Int J Adv 
Res Comput Communication Engineering vol. 3 no.5 pp. 6794-6803. 


Sridevi, N & Subashini, P 2012, ‘Segmentation of Text Lines and 
Characters in Ancient Tamil Script Documents using Computational 
Intelligence Techniques’. International Journal of Computer 
Applications, vol. 52, no. 14. 


Su B, Lu S & Tan, CL 2013’ ‘Robust document image binarization 
technique for degraded document images’ . IEEE Trans Image Process 
vol. 22 no. 4 pp. 1408-1417. 


Suganya, TS, & Murugavali S, 2014, ‘Binarization of Ancient Tamil 
Scripts from Stone inscriptions’. Global journal for research analysis, 
vol. 3, no. 11: 72-74. 


Tamen, Z, Drias, H & Boughaci, D, 2017, ‘An efficient multiple 
classifier system for Arabic handwritten words recognition’. Pattern 
Recognition Letters, vol. 93, pp .123-132. 


Tan, J, Lai, JH, Wang, CD, Wang, WX & Zuo, XX, 2012, ‘A new 
handwritten character segmentation method based on_ nonlinear 
clustering. Neurocomputing, vol. 89, pp. 213-219. 


Tomasi C, & Manduchi R, 1998, ‘Bilateral filtering for gray and color 


images’. In: Proceedings of the IEEE international conference on 
computer vision, pp. 839-846. 


eoaia ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


93. 


94. 


95. 


96. 


oF; 


98. 


ee 


161 


Utpal Garain, Thierry Paquet & Laurent Heutte, 2005 ‘On Foreground- 
Background Separation in Low Quality Color Document Images’ In 
Eight International Conference on Document Analysis and 
Recognition. 


Vellingiriraj, EK, Balamurugan, M & Balasubramanie, P 2016, ‘Text 
Analysis and Information Retrieval of Historical Tamil Ancient 
Documents Using Machine Translation in Image Zoning’. International 
Journal of Languages, Literature and Linguistics, vol. 2, no. 4, 
pp. 164-168. 


Wahi, A, Sundaramurthy, S & Poovizhi, P, 2015, ‘Handwritten Tamil 
character recognition using Zernike moments and_ legendre 
polynomial’, In Artificial Intelligence and Evolutionary Algorithms in 
Engineering Systems, pp. 595-603. 


Xiao, X., Jin, L., Yang, Y., Yang, W., Sun, J. & Chang, T, 2017, 
‘Building fast and compact convolutional neural networks for offline 
handwritten Chinese character recognition’. Pattern Recognition, 
pp. 72-81. 


Yao, Zhenjie, & Weidong Yi, 2016, ‘Curvature aided Hough transform 
for circle detection’. Expert Systems with Applications vol. 51, 
pp. 26-33. 


Zheng, Liying, Abbas, H, Hassin & Xianglong Tang, 2004, ‘A new 
algorithm for machine printed Arabic character segmentation’ Pattern 
Recognition Letters vol. 25, no. 15, pp. 1723-1729. 


Zhou, SF, Liu, CP, Cui, ZM & Gong, SR, 2009, ‘An improved 


adaptive document image binarization method’. In Image and Signal 
Processing, IEEE 2nd International Conference, pp. 1-5. 


eoaia ANNA UNIVERSITY, CHENNAI - 600 025 (4S) 


162 


LIST OF PUBLICATIONS 


International Journals 


hk Durga Devi, K & Uma Maheswari, P 2018, ‘Digital acquisition and 
character extraction from stone inscription images using modified 
fuzzy entropy-based adaptive thresholding’. Soft Computing, Springer, 
vol. 23, no. 8, pp. 2611-2626. (Annexure 1, Impact Factor: 2.367) 


‘ead ANNA UNIVERSITY, CHENNAI - 600 025 (a 


A NOVEL APPROACH FOR DIGITIZATION OF 
ANCIENT STONE INSCRIPTIONS AND 
DEVELOPMENT OF MACHINE LEARNING 
MODELS FOR CHARACTER RECOGNITION 
AND TRANSLITERATION 


A THESIS 


Submitted by 


DURGA DEVI K 


in partial fulfillment of the requirements for the degree of 


DOCTOR OF PHILOSOPHY 


FACULTY OF INFORMATION AND 
COMMUNICATION ENGINEERING 
ANNA UNIVERSITY 
CHENNAI 600 025 


NOVEMBER 2019 


145 


CHAPTER 8 


CONCLUSION AND FUTURE WORK 


8.1 CONCLUSION 


Image processing and pattern recognition techniques have made 
great strides in recent times. As far as stone inscriptions are concerned, 
however, the method of acquisition used by the Department of Archeology is 
traditional, laborious and time- consuming. The manual work of translation is 
done by epigraphers, and these manually-acquired inscriptions are preserved 
only in a degradable paper format. A huge mass of inscriptions is available, as 
yet unnoticed by the Department of Archeology. A digital acquisition of stone 
inscriptions will resolve the issue of the manual work still in progress, greatly 
reducing the time taken for the process, from as much as a few years to as 
little as a month. Such a digital recognition system is essential on account of 
the rapid deterioration and loss of content from stone inscription images. This 
research has focused on state-of-the-art approaches in digital acquisition, and 
on the automated transliteration of characters found in stone inscription 


images. 


A brief study of the literature on the subject makes it clear that 
work on character recognition has only focused on clear document images 
and scanned handwritten images. There is very little research on recognizing 
characters from real- time images. Intensive research in the field of OCR is 
chiefly driven by the challenge it presents in simulating human reading, and 


also because it provides efficient applications such as the automatic 
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processing of bulk papers, transferring data to machines, and web interfaces 
for paper documents. To have machines replicate human functions and 


perform common tasks like reading is a dream come true. 


This research has dealt with the complexities involved in separating 
foreground characters from the background. There are difficulties involved in 
dealing with input images, especially in terms of complex variations in 
character shapes and deformities brought on by deterioration. Handling 
document images is far easier than handling stone inscription images, given 
that the latter revel a lack of continuity in inscription character. Hence, an 
efficient binarization technique has been proposed to extract characters from 


stone inscriptions. 


The next module of this research has resolved the issue of 
segmenting complete scripts into individual characters. the complete scripts 
into characters. To this end, an adaptive thresholding technique has been 
proposed, based on a histogram plot, that helps segment lines and characters. 
In the subsequent stage, machine learning models have been developed to 
identify unique character features that help recognize and classify characters. 
Both the pattern matching algorithm and ANN produced a lower recognition 
rate and a higher _misclassification rate. Consequently, the deep learning 
model, CNN, has been proposed to train a large number of characters and 


recognize inscription characters by achieving a higher accuracy rate. 


The results show that the techniques used in the study have 
demonstrated a good accuracy rate. The work will help epigraphists and 
researchers access and interpret the information and knowledge retrieved 
from ancient stone inscription images. Further, it will keep the general public 
interested in the contents of inscriptions, now accessible in the contents of 


inscriptions, now accessible in a user-friendly digital format. In addition, this 
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system will be of use to the Department of Archeology as it greatly reduces 


the time taken for manual translation. 
8.2 FUTURE WORK 


From this study and the results obtained, future directions can 


include the following: 


e Since the proposed enhancement technique works for clear 
inscription images, future work can focus on reconstructing 


deteriorated characters. 


e The proposed work has concentrated only on 11” century 
inscriptions, on account of the huge volume of characters 
handled, stemming from the proliferation of work done during 
the era. In the future, inscriptions from other centuries can be 


handled as well. 


e The accurate prediction of compound characters and vowels can 
be improved by training the deep learning model on a much 


larger character dataset. 


e This system transliterates characters as such. Future work can 
include the grouping of characters into word so as to form 


meaningful sentences. 
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